WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification ? : 

C12N 15/31, C07K 14/315, 16/12, G01N 
33/50, A61K 39/09, C12Q 1/68 



A2 



(11) International Publication Number: WO 00/06737 

(43) International Publication Date: 10 February 2000 (10.02.00) 



(21) International Application Number: PCT/GB99/0245 1 

(22) International Filing Date: 27 July 1999 (27.07.99) 



(30) Priority Data: 

9816337.1 
60/125,164 



27 July 1998 (27.07.98) GB 
19 March 1999 (19.03.99) US 



(71) Applicant (for all designated States except US): MICROBIAL 

TECHNICS LIMITED [GB/GB]; 20 Trumpington Street, 
Cambridge CB2 1QA (GB). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): GILBERT, Christophe, 
Francois, Guy [FR/GB]; University of Cambridge, Dept. 
of Pathology, Tennis Court Road, Cambridge CB1 1PQ 
(GB). HANSBRO, Philip, Michael [GB/GB]; University 
of Cambridge, Dept. of Pathology, Tennis Court Road, 
Cambridge CB2 1QP (GB). 

(74) Agents: CHAPMAN, Paul, William et al.; Kilburn & Strode, 
20 Red Lion Street, London WC1R 4PJ (GB). 



(81) Designated States: CN, JP, US, European patent (AT, BE, CH, 
CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, NL, 
PT, SE). 



Published 

Without international search report and to be republished 
upon receipt of that report. 



(54) Title: STREPTOCOCCUS PNEUMONIAE PROTEINS AND NUCLEIC ACID MOLECULES 
(57) Abstract 

Novel protein antigens from Streptococcus pneumoniae are disclosed, together with nucleic acid sequences encoding them. Their use 
in vaccines and in screening methods is also described. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZW 


Zimbabwe 


CI 


C6te d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






cz 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 00/06737 



PCT/GB99/02451 



STREPTOCOCCUS PNEUMONIAE PROTEINS AND NUCLEIC ACID MOLECULES 

The present invention relates to proteins derived from Streptococcus pneumoniae, 
nucleic acid molecules encoding such proteins, the use of the nucleic acid and/or 
proteins as antigens/immunogens and in detection/diagnosis, as well as methods for 
5 screening the proteins/nucleic acid sequences as potential anti-microbial targets. 

Streptococcus pneumoniae, commonly referred to as the pneumococcus, is an 
important pathogenic organism. The continuing significance of Streptoccocus 
pneumoniae infections in relation to human disease in developing and developed 

10 countries has been authoritatively reviewed (Fiber, G.R., Science, 265: 1385-1387 
(1994)). That indicates that on a global scale this organism is believed to be the 
most common bacterial cause of acute respiratory infections, and is estimated to 
result in 1 million childhood deaths each year, mostly in developing countries 
(Stansfield, S.K., Pediatr. Infect. Dis., 6: 622 (1987)). In the USA it has been 

15 suggested (Breiman et al, Arch. Intern. Med., 150: 1401 (1990)) that the 
pneumococcus is still the most common cause of bacterial pneumonia, and that 
disease rates are particularly high in young children, in the elderly, and in patients 
with (predisposing conditions such as asplenia, heart, lung and kidney disease, 
diabetes, alcoholism, or with immunosupressive disorders, especially AIDS. These 

20 groups are at higher risk of pneumococcal septicaemia and hence meningitis and 
therefore have a greater risk of dying from pneumococcal infection. The 
pneumococcus is also the leading cause of otitis media and sinusitis, which remain 
prevalent infections in children in developed countries, and which incur substantial 
costs. 

25 

The need for effective preventative strategies against pneumococcal infection is 
highlighted by the recent emergence of penicillin-resistant pneumococci. It has been 
reported that 6.6% of pneumoccal isolates in 13 US hospitals in 12 states were found 
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to be resistant to penicillin and some isolates were also resistant to other antibiotics 
including third generation cyclosporins (Schappert, S.M., Vital and Health Statistics 
of the Centres for Disease Control/National Centre for Health Statistics, 214:1 
(1992)). The rates of penicillin resistance can be higher (up to 20%) in some 
5 hospitals (Breiman et al, J. Am. Med. Assoc., 271: 1831 (1994)). Since the 
development of penicillin resistance among pneumococci is both recent and sudden, 
coming after decades during which penicillin remained an effective treatment, these 
findings are regarded as alarming. 

10 For the reasons given above, there are therefore compelling grounds for considering 
improvements in the means of preventing, controlling, diagnosing or treating 
pneumococcal diseases. 

Various approaches have been taken in order to provide vaccines for the prevention 
15 of pneumococcal infections. Difficulties arise for instance in view of the variety of 
serotypes (at least 90) based on the structure of the polysaccharide capsule 
surrounding the organism. Vaccines against individual serotypes are not effective 
against other serotypes and this means that vaccines must include polysaccharide 
antigens from a whole range of serotypes in order to be effective in a majority of 
20 cases. An additional problem arises because it ahs been found that the capsular 

polysaccharides (each of which determines the serotype and is the major protective 
antigen) when purified and used as a vaccine do not reliably induce protective 
antibody responses in children under two years of age, the age group which suffers 
the highest incidence of invasive pneumococcal infection and meningitis. 

25 

A modification of the approach using capsule antigens relies on conjugating the 
polysaccharide to a protein in order to derive an enhanced immune response, 
particularly by giving the response T-cell dependent character. This approach has 
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been used in the development of a vaccine against Haemophilus influenzae. There 
are issues of cost concerning both the multi-polysaccharide vaccines and those based 
on conjugates. 

5 

A third approach is to look for other antigenic components which offer the potential 
to be vaccine candidates. In the present application we provide a group of proteins 
antigens which are secreted/exported proteins. 

10 Thus, in a first aspect the present invention provides a Streptococcus pneumoniae 
protein or polypeptide having a sequence selected from those shown in table 2 
herein. 

A protein or polypeptide of the present invention may be provided in substantially pure 
15 form. For example, it may be provided in a form which is substantially free of other 
proteins. 

In a preferred embodiment, a protein or polypeptide having an amino acid sequence as 
shown in Table 3 is provided. 

20 

The invention encompasses any protein coded for by a nucleic acid sequence as shown 
in Table 1 herein. 

As discussed herein, the proteins and polypeptides of the invention are useful as 
25 antigenic material. Such material can be "antigenic" and/or "immunogenic". 
Generally, "antigenic" is taken to mean that the protein or polypeptide is capable of 
being used to raise antibodies or indeed is capable of inducing an antibody response in 
a subject. "Immunogenic" is taken to mean that the protein or polypeptide is capable of 
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eliciting a protective immune response in a subject. Thus, in the latter case, the protein 
or polypeptide may be capable of not only generating an antibody response and in 
addition non-antibody based immune responses. 



The skilled person will appreciate that homologues or derivatives of the proteins or 
10 polypeptides of the invention will also find use in the context of the present invention, 
ie as antigenic/immunogenic material. Thus, for instance proteins or polypeptides 
which include one or more additions, deletions, substitutions or the like are 
encompassed by the present invention. In addition, it may be possible to replace one 
amino acid with another of similar "type". For instance replacing one hydrophobic 
15 amino acid with another. One can use a program such as the CLUSTAL program to 
compare amino acid sequences. This program compares amino acid sequences and 
finds the optimal alignment by inserting spaces in either sequence as appropriate. It is 
possible to calculate amino acid identity or similarity (identity plus conservation of 
amino acid type) for an optimal alignment. A program like BLASTx will align the 
20 longest stretch of similar sequences and assign a value to the fit. It is thus possible to 
obtain a comparison where several regions of similarity are found, each having a 
different score. Both types of analysis are contemplated in the present invention. 

In the case of homologues and derivatives, the degree of identity with a protein or 
25 polypeptide as described herein is less important than that the homologue or derivative 
should retain its antigenicity or immunogenicity to streptoccocus pneumoniae. 
However, suitably, homologues or derivatives having at least 60% similarity (as 
discussed above) with the proteins or polypeptides described herein are provided. 
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Preferably, homologues or derivatives having at least 70% similarity, more preferably 
at least 80% similarity are provided. Most preferably, homologues or derivatives 
having at least 90% or even 95% similarity are provided. 

5 In an alternative approach, the homologues or derivatives could be fusion proteins, 
incorporating moieties which render purification easier, for example by effectively 
tagging the desired protein or polypeptide. It may be necessary to remove the "tag" or 
it may be the case that the fusion protein itself retains sufficient antigenicity to be 
useful. 

10 

In an additional aspect of the invention there are provided antigenic fragments of the 
proteins or polypeptides of the invention, or of homologues or derivatives thereof. 

For fragments of the proteins or polypeptides described herein, or of homologues or 
15 derivatives thereof, the situation is slightly different. It is well known that is possible to 
screen an antigenic protein or polypeptide to identify epitopic regions, ie those regions 
which are responsible for the protein or polypeptide's antigenicity or immunogenicity. 
Methods for carrying out such screening are well known in the art. Thus, the fragments 
of the present invention should include one or more such epitopic regions or be 
20 sufficiently similar to such regions to retain their antigenic/immunogenic properties. 
Thus, for fragments according to the present invention the degree of identity is perhaps 
irrelevant, since they may be 100% identical to a particular part of a protein or 
polypeptide, homologue or derivative as described herein. The key issue, once again, is 
that the fragment retains the antigenic/immunogenic properties. 

25 

Thus, what is important for homologues, derivatives and fragments is that they possess 
at least a degree of the antigenicity/immunogenicity of the protein or polypeptide from 
which they are derived. 
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Gene cloning techniques may be used to provide a protein of the invention in 
substantially pure form. These techniques are disclosed, for example, in J. Sambrook 
et al Molecular Cloning 2nd Edition, Cold Spring Harbor Laboratory Press (1989). 
Thus, in a fourth aspect, the present invention provides a nucleic acid molecule 
comprising or consisting of a sequence which is: 

(i) any of the DNA sequences set out in Table 1 or their RNA equivalents; 

(ii) a sequence which is complementary to any of the sequences of (i); 

(iii) a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 

(iv) a sequence which is has substantial identity with any of those of (i), (ii) 
and (iii); 

(v) a sequence which codes for a homologue, derivative or fragment of a 
protein as defined in Table 1 . 

In a fifth aspect the present invention provides a nucleic acid molecule comprising or 
consisting of a sequence which is: 

(i) any of the DNA sequences set out in Table 4 or their RNA equivalents; 

(ii) a sequence which is complementary to any of the sequences of (i); 
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(iii) a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 

(iv) a sequence which is has substantial identity with any of those of (i), (ii) 
and (iii); 

(v) a sequence which codes for a homologue, derivative or fragment of a protein 
as defined in Table 4. 

The nucleic acid molecules of the invention may include a plurality of such sequences, 
and/or fragments. The skilled person will appreciate that the present invention can 
include novel variants of those particular novel nucleic acid molecules which are 
exemplified herein. Such variants are encompassed by the present invention. These 
may occur in nature, for example because of strain variation. For example, additions, 
substitutions and/or deletions are included. In addition, and particularly when utilising 
microbial expression systems, one may wish to engineer the nucleic acid sequence by 
making use of known preferred codon usage in the particular organism being used for 
expression. Thus, synthetic or non-naturally occurring variants are also included within 
the scope of the invention. 

The term "RNA equivalent" when used above indicates that a given RNA molecule has 
a sequence which is complementary to that of a given DNA molecule (allowing for the 
fact that in RNA "IT replaces "T" in the genetic code). 

When comparing nucleic acid sequences for the purposes of determining the degree of 
homology or identity one can use programs such as BESTFIT and GAP (both from the 
Wisconsin Genetics Computer Group (GCG) software package) BESTFIT, for 
example, compares two sequences and produces an optimal alignment of the most 
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similar segments. GAP enables sequences to be aligned along their whole length and 
finds the optimal alignment by inserting spaces in either sequence as appropriate. 
Suitably, in the context of the present invention compare when discussing identity of 
nucleic acid sequences, the comparison is made by alignment of the sequences along 
5 their whole length. 

Preferably, sequences which have substantial identity have at least 50% sequence 
identity, desirably at least 75% sequence identity and more desirably at least 90 or at 
least 95% sequence identity with said sequences. In some cases the sequence identity 
10 may be 99% or above. 

Desirably, the term "substantial identity" indicates that said sequence has a greater 
degree of identity with any of the sequences described herein than with prior art nucleic 
acid sequences. 

15 

It should however be noted that where a nucleic acid sequence of the present invention 
codes for at least part of a novel gene product the present invention includes within its 
scope all possible sequence coding for the gene product or for a novel part thereof. 

20 The nucleic acid molecule may be in isolated or recombinant form. It may be 
incorporated into a vector and the vector may be incorporated into a host. Such vectors 
and suitable hosts form yet further aspects of the present invention. 

Therefore, for example, by using probes based upon the nucleic acid sequences 
25 provided herein, genes in Streptococcus pneumoniae can be identified. They can then 
be excised using restriction enzymes and cloned into a vector. The vector can be 
introduced into a suitable host for expression. 
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Nucleic acid molecules of the present invention may be obtained from S.pneumoniae by 
the use of appropriate probes complementary to part of the sequences of the nucleic 
acid molecules. Restriction enzymes or sonication techniques can be used to obtain 
appropriately sized fragments for probing. 

5 

Alternatively PCR techniques may be used to amplify a desired nucleic acid sequence. 
Thus the sequence data provided herein can be used to design two primers for use in 
PCR so that a desired sequence, including whole genes or fragments thereof, can be 
targeted and then amplified to a high degree. One primer will normally show a high 
10 degree of specificity for a first sequence located on one strand of a DNA molecule, and 
the other primer will normally show a high degree of specificity for a second sequence 
located on the complementary strand of the DNA sequence and being spaced from the 
complementary sequence to the first sequence. 

15 Typically primers will be at least 15-25 nucleotides long. 

As a further alternative chemical synthesis may be used. This may be automated. 
Relatively short sequences may be chemically synthesised and ligated together to 
provide a longer sequence. 

20 

In yet a further aspect the present invention provides an immunogenic/antigenic 
composition comprising one or more proteins or polypeptides selected from those 
whose sequences are shown in Tables 2-4, or homologues or derivatives thereof, 
and/or fragments of any of these. In preferred embodiments, the 
25 immunogenic/ antigenic composition is a vaccine or is for use in a diagnostic assay. 

In the case of vaccines suitable additional excipients, diluents, adjuvants or the like 
may be included. Numerous examples of these are well known in the art. 
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It is also possible to utilise the nucleic acid sequences shown in Table 1 in the 
preparation of so-called DNA vaccines. Thus, the invention also provides a vaccine 
composition comprising one or more nucleic acid sequences as defined herein. The 
5 use of such DNA vaccines is described in the art. See for instance, Donnelly et al , 
Ann. Rev. Immunol., 15:617-648 (1997). 

As already discussed herein the proteins or polypeptides described herein, their 
homologues or derivatives, and/or fragments of any of these, can be used in methods 

10 of detecting/diagnosing S. pneumoniae. Such methods can be based on the detection 
of antibodies against such proteins which may be present in a subject. Therefore the 
present invention provides a method for the detection/diagnosis of S.pneumoniae 
which comprises the step of bringing into contact a sample to be tested with at least 
one protein, or homologue, derivative or fragment thereof, as described herein. 

15 Suitably, the sample is a biological sample, such as a tissue sample or a sample of 
blood or saliva obtained from a subject to be tested. 

In an alternative approach, the proteins described herein, or homologues, derivatives 
and/or fragments thereof, can be used to raise antibodies, which in turn can be used 
20 to detect the antigens, and hence S.pneumoniae. Such antibodies form another aspect 
of the invention. Antibodies within the scope of the present invention may be 
monoclonal or polyclonal. 

Polyclonal antibodies can be raised by stimulating their production in a suitable animal 
25 host (e.g. a mouse, rat, guinea pig, rabbit, sheep, goat or monkey) when a protein as 
described herein, or a homologue, derivative or fragment thereof, is injected into the 
animal. If desired, an adjuvant may be administered together with the protein. Well- 
known adjuvants include Freund's adjuvant (complete and incomplete) and aluminium 
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hydroxide. The antibodies can then be purified by virtue of their binding to a protein as 
described herein. 

Monoclonal antibodies can be produced from hybridomas. These can be formed by 
5 fusing myeloma cells and spleen cells which produce the desired antibody in order to 
form an immortal cell line. Thus the well-known Kohler & Milstein technique {Nature 
256 (1975)) or subsequent variations upon this technique can be used. 

Techniques for producing monoclonal and polyclonal antibodies that bind to a 
10 particular polypeptide/protein are now well developed in the art. They are discussed in 
standard immunology textbooks, for example in Roitt et al, Immunology second edition 
(1989), Churchill Livingstone, London. 

In addition to whole antibodies, the present invention includes derivatives thereof which 
15 are capable of binding to proteins etc as described herein. Thus the present invention 
includes antibody fragments and synthetic constructs. Examples of antibody fragments 
and synthetic constructs are given by Dougall et al in Tibtech 12 372-379 (September 
1994). 

20 Antibody fragments include, for example, Fab, F(ab') 2 and Fv fragments. Fab 
fragments (These are discussed in Roitt et al [supra] ). Fv fragments can be modified 
to produce a synthetic construct known as a single chain Fv (scFv) molecule. This 
includes a peptide linker covalently joining V h and V! regions, which contributes to the 
stability of the molecule. Other synthetic constructs that can be used include CDR 

25 peptides. These are synthetic peptides comprising antigen-binding determinants. 
Peptide mimetics may also be used. These molecules are usually conformationally 
restricted organic rings that mimic the structure of a CDR loop and that include 
antigen-interactive side chains. 
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Synthetic constructs include chimaeric molecules. Thus, for example, humanised (or 
primatised) antibodies or derivatives thereof are within the scope of the present 
invention. An example of a humanised antibody is an antibody having human 
5 framework regions, but rodent hypervariable regions. Ways of producing chimaeric 
antibodies are discussed for example by Morrison et al in PNAS, 81, 6851-6855 (1984) 
and by Takeda et al in Nature. 314, 452-454 (1985). 

Synthetic constructs also include molecules comprising an additional moiety that 
10 provides the molecule with some desirable property in addition to antigen binding. For 
example the moiety may be a label (e.g. a fluorescent or radioactive label). 
Alternatively, it may be a pharmaceutically active agent. 

Antibodies, or derivatives thereof, find use in detection/diagnosis of S. pneumoniae. 
15 Thus, in another aspect the present invention provides a method for the 
detection/diagnosis of S.pneumoniae which comprises the step of bringing into contact 
a sample to be tested and antibodies capable of binding to one or more proteins 
described herein, or to homologues, derivatives and/or fragments thereof. 

20 In addition, so-called "Affibodies" may be utilised. These are binding proteins 
selected from combinatorial libraries of an alpha-helical bacterial receptor domain 
(Nord et al , ) Thus, Small protein domains, capable of specific binding to different 
target proteins can be selected using combinatorial approaches. 

25 

It will also be clear that the nucleic acid sequences described herein may be used to 
detect/diagnose S.pneumoniae. Thus, in yet a further aspect, the present invention 
provides a method for the detection/diagnosis of S.pneumoniae which comprises the 
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step of bringing into contact a sample to be tested with at least one nucleic acid 
sequence as described herein. Suitably, the sample is a biological sample, such as a 
tissue sample or a sample of blood or saliva obtained from a subject to be tested. 
Such samples may be pre-treated before being used in the methods of the invention. 
Trhus, for example, a sample may be treated to extract DNA. Then, DNA probes 
based on the nucleic acid sequences described herein (ie usually fragments of such 
sequences) may be used to detect nucleic acid from S. pneumoniae. 

In additional aspects, the present invention provides: 

(a) a method of vaccinating a subject against S.pneumoniae which comprises the 
step of administering to a subject a protein or polypeptide of the invention, or a 
derivative, homologue or fragment thereof, or an immunogenic composition of the 
invention; 

(b) a method of vaccinating a subject against S.pneumoniae which comprises the 
step of administering to a subject a nucleic acid molecule as defined herein; 

(c) a method for the prophylaxis or treatment of S.pneumoniae infection which 
comprises the step of administering to a subject a protein or polypeptide of the 
invention, or a derivative, homologue or fragment thereof, or an immunogenic 
composition of the invention; 

(d) a method for the prophylaxis or treatment of S.pneumoniae infection which 
comprises the step of administering to a subject a nucleic acid molecule as defined 
herein; 

(e) a kit for use in detecting/diagnosing S.pneumoniae infection comprising one 
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or more proteins or polypeptides of the invention, or homologues, derivatives or 
fragments thereof, or an antigenic composition of the invention; and 

(f) a kit for use in detecting/diagnosing S. pneumoniae infection comprising one 
5 or more nucleic acid molecules as defined herein. 

Given that we have identified a group of important proteins, such proteins are 
potential targets for anti-microbial therapy. It is necessary, however, to determine 
whether each individual protein is essential for the organism's viability. Thus, the 
10 present invention also provides a method of determining whether a protein or 

polypeptide as described herein represents a potential anti-microbial target which 
comprises inactivating said protein and determining whether S.pneumoniae is still 
viable, in vitro or in vivo. 

15 A suitable method for inactivating the protein is to effect selected gene knockouts, ie 
prevent expression of the protein and determine whether this results in a lethal 
change. Suitable methods for carrying out such gene knockouts are described in Li et 
al P.N.A.S., 94:13251-13256 (1997). 

20 In a final aspect the present invention provides the use of an agent capable of 

antagonising, inhibiting or otherwise interfering with the function or expression of a 
protein or polypeptide of the invention in the manufacture of a medicament for use in 
the treatment or prophylaxis of S. pneumoniae infection. 

25 The invention will now be described with reference to the following examples, 
which should not be construed as in any way limiting the invention. The examples 
refer to the figures in which: 
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Figure 1: shows the results of various DNA vaccine trials; and 
Figure 2: shows the results of further DNA vaccine trials. 

EXAMPLE 1 

5 

The Genome sequencing of Streptococcus pneumoniae type 4 is in progress at the 

Institute for Genomic Research (TIGR, Rockville, MD, USA). Up to now, the 
whole sequence has not been completed or published. On 21 st November 1997, the 

10 TIGR centre released some DNA sequences as contigs which are not accurate 
reflections of the finished sequence. These contigs can be downloaded from their 
Webster (www@tigr.org) . We downloaded these contigs and created a local database 
using the application GCGToBLAST (Wisconsin Package Version 9.1, Genetics 
Computer Group (GCG), Madison, USA). This database can be searched with the 

15 Fast A and TfastA procedures (using the method of Pearson and Lipman (PNAS USA, 
85:2444-2448 (1988)). 

Using Fast A and TfastA procedures, the local pneumococcus database was searched 
for putative leader sequence or anchor sequence features. Relevant sequences were 
20 used to interrogate for comparative novel sequences. These were: 

(i) already described leader sequences of Streptococcus pneumoniae (from 
proteins NanA, NanB, LytA, Pap A, pcpA, PsaA and PspA); 

25 (ii) the leader sequence of Usp45, a secreted protein from Lactococcus lactis; 



(iii) new hypothetical leader sequences derived from the searches in (i) and 
(ii); 
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(iv) the anchor motif LPxTG, a feature common to many Gram-positive 
bacteria surface proteins which are anchored by a mechanism involving the 
Sortase complex proteins. 

5 

Provided below is an example of this approach, with reference to the sequences 
derived from the database (see table 1). 
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Usp45 



NanA 



PapA PsaA 



PspA 



PrtP 





4121.1 4101.3 4172.4 4106.1 4346.1 4194.1 



4172.2 



4147.2 



1 



4191.6 



4137.3 4193.1 4122.1 



4170.8 4175.3 



4187.2 4188.1 4102.1 



4136.1 4127.1 4191.1 

1 i 



4119.1 



4119.2 



The protein leader sequences of different known exported proteins were used as a 
starting point for a search of the local pneumococcus database described above. The 
hypothetical proteins found with this search were then submitted to a Blast search in 
general databases such as EMBL, Swissprot etc. Proteins remaining unknown in the 
pneumococcus are kept and annotated. Then the search is performed again using the 
new potential protein leader sequence as a probe, using the TfastA procedure. 
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Example 2: DNA vaccine trials 
pcDNA3.1 + as a DNA vaccine vector 
5 pcDNA3.1 + 

The vector chosen for use as a DNA vaccine vector was pcDN A3 . 1 (Invitrogen) 
(actually pcDNA3. 1 + , the forward orientation was used in all cases but may be 
referred to as pcDNA3.1 here on). This vector has been widely and successfully 

10 employed as a host vector to test vaccine candidate genes to give protection against 
pathogens in the literature (Zhang, et al, Kurar and Splitter, Anderson et a/.). The 
vector was designed for high-level stable and non-replicative transient expression in 
mammalian cells. pcDNA3.1 contains the ColEl origin of replication which allows 
convenient high-copy number replication and growth in E. coli. This in turn allows 

15 rapid and efficient cloning and testing of many genes. The pcDNA3. 1 vector has a 
large number of cloning sites and also contains the gene encoding ampicillin 
resistance to aid in cloning selection and the human cytomegalovirus (CMV) 
immediate-early promoter/enhancer which permits efficient, high-level expression of 
the recombinant protein. The CMV promoter is a strong viral promoter in a wide 

20 range of cell types including both muscle and immune (antigen presenting) cells. 

This is important for optimal immune response as it remains unknown as to which 
cells types are most important in generating a protective response in vivo. A T7 
promoter upstream of the multiple cloning site affords efficient expression of the 
modified insert of interest and which allows in vitro transcription of a cloned gene in 

25 the sense orientation. 

Zhang, D., Yang, X., Berry, J. Shen, C, McClarty, G. and Brunham, R.C. (1997) 
"DNA vaccination with the major outer-membrane protein genes induces acquired 
immunity to Chlamydia trachomatis (mouse pneumonitis) infection". Infection and 
30 Immunity, 176, 1035-40. 

Kurar, E. and Splitter, G.A. (1997) "Nucleic acid vaccination of Brucella abortus 
ribosomal L7/L12 gene elicits immune response". Vaccine, 15, 1851-57. 

35 Anderson, R., Gao, X.-M., Papakonstantinopoulou, A., Roberts, M. and Dougan, 
G. (1996) "Immune response in mice following immunisation with DNA encoding 
fragment C of tetanus toxin". Infection and Immunity, 64, 3168-3173. 

Preparation of DNA vaccines 

40 

Oligonucleotide primers were designed for each individual gene of interest derived 
using the LEEP system. Each gene was examined thoroughly, and where possible, 
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primers were designed such that they targeted that portion of the gene thought to 
encode only the mature portion of the gene protein. It was hoped that expressing 
those sequences that encode only the mature portion of a target gene protein, would 
facilitate its correct folding when expressed in mammalian cells. For example, in the 
5 majority of cases primers were designed such that putative N-terminal signal peptide 
sequences would not be included in the final amplification product to be cloned into 
the pcDNA3.1 expression vector. The signal peptide directs the polypeptide 
precursor to the cell membrane via the protein export pathway where it is normally 
cleaved off by signal peptidase I (or signal peptidase II if a lipoprotein). Hence the 
10 signal peptide does not make up any part of the mature protein whether it be 

displayed on the surface of the bacteria surface or secreted. Where a N-terminal 
leader peptide sequence was not immediately obvious, primers were designed to 
target the whole of the gene sequence for cloning and ultimately, expression in 
pcDNA3.1. 

15 

Having said that, however, other additional features of proteins may also affect the 
expression and presentation of a soluble protein. DNA sequences encoding such 
features in the genes encoding the proteins of interest were excluded during the 
20 design of oligonucleotides. These features included: 

1. LPXTG cell wall anchoring motifs. 

2. LXXC ipoprotein attachment sites. 

3. Hydrophobic C-terminal domain. 

25 4. Where no N-terminal signal peptide or LXXC was present the start codon was 
excluded. 

5. Where no hydrophobic C-terminal domain or LPXTG motif was present the stop 
codon was removed. 

30 Appropriate PCR primers were designed for each gene of interest and any and all of 
the regions encoding the above features was removed from the gene when designing 
these primers. The primers were designed with the appropriate enzyme restriction 
site followed by a conserved Kozak nucleotide sequence (in all cases) GCCACC was 
used. The Kozak sequence facilitates the recognition of initiator sequences by 

35 eukaryotic ribosomes) and an ATG start codon upstream of the insert of the gene of 
interest. For example the forward primer using a BamHl site the primer would 
begin GCGGGATCCGCCACCATG followed by a small section of the 5* end of the 
gene of interest. The reverse primer was designed to be compatible with the forward 
primer and with a Notl restriction site at the 5' end in all cases (this site is 

40 TTGCGGCCGC). 

PCR primers 
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The following PCR primers were designed and used to amplify the truncated genes 
of interest. 

5 

ID210 

Forward Primer 5' CGGATCCGCCACCATGTCTTCTAATGAATCTGCCGATG 
3' 

10 Reverse Primer 5* TTGCGGCCGCCTGTTTAGATTGGATATCTGTAAAGACTT 
3' 

4172.5 

15 Forward Primer 5 ' 

CGCGGATCCGCC ACC ATGGATTTTCCTTCAAATTTGGAGG 3 ' 
Reverse Primer 5' TTGCGGCCGCACCGTACTGGCTGCTGACT 3' 

ID211 

20 

Forward Primer 5' 

CGGATCCGCCACCATGAGTGAGATCAAAATTATTAACGC 3 ' 
Reverse Primer 5' TTGCGGCCGCCGTTCCATGGTTGACTCCT 3' 

25 4197.4 

Forward Primer 5' CGCGGATCCGCCACCATGTGGGACATATTGGTGGAAAC 
3" 

Reverse Primer 5' TTGCGGCCGCTTCACTTGAGCAAACTGAATCC 3' 

30 

4122.1 

Forward Primer 5 ' 

CGCGGATCCGCCACCATGTCACAAGAAAAAACAAAAAATGAA 3 ' 
35 Reverse Primer 5' TTGCGGCCGCATCGACGTAGTCTCCGCC 3' 

4126.7 

Forward Primer 5 ' 

40 CGCGGATCCGCCACCATGCTGGTTGGAACTTTCTACTATCAAT 3' 
Reverse Primer 5' TTGCGGCCGCAACTTTCGTCCCTTTTTGG 3' 
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4188.11 

Forward Primer 5' CGCGGATCCGCCACCATGGGCAATTCTGGCGGAA 3' 
Reverse Primer 5" TTGCGGCCGCTTGTTTCATAGCTTTTTTGATTGTT 3' 

5 

ID209 

Forward Primer 5' 

CGCGGATCCGCCACCATGCTATTGATACGAA ATGCAGGG 3 ' 
10 Reverse Primer 5' TTGCGGCCGCAACATAATCTAGTAAATAAGCGTAGCC 3' 

ID215 

Forward Primer 5' CGCGGATCCGCCACCATGACGGCGACGAATTTTC 3' 
15 Reverse Primer 5' TTGCGGCCGCTTAATTCGTTTTTGAACTAGTTGCT 3' 

4170.4 

Forward Primer 5' 

20 CGCGGATCCGCCACCATGGCTGTTTTTCTTCGCTATCATG 3' 

Reverse Primer 5' TTGCGGCCGCTTTCTTCAACAAACCTTGTTCTTG 3' 

4193.1 

25 Forward Primer 5' 

CGCGGATCCGCCACCATGGGTAACCGCTCTTCTCGTAAC 3 ' 
Reverse Primer 5' TTGCGGCCGCGCTTCCATCAAGGATTTTAGC 3' 

Cloning 

30 

The insert along with the flanking features described above was amplified using PCR 
against a template of genomic DNA isolated from type 4 S. pneumoniae strain 11886 
obtained from the National Collection of Type Cultures. The PCR product was cut 
with the appropriate restriction enzymes and cloned in to the multiple cloning site of 

35 pcDNA3.1 using conventional molecular biological techniques. Suitably mapped 
clones of the genes of interested were cultured and the plasmids isolated on a large 
scale (> 1.5 mg) using Plasmid Mega Kits (Qiagen). Successful cloning and 
maintenance of genes was confirmed by restriction mapping and sequencing ~ 700 
base pairs through the 5 ' cloning junction of each large scale preparation of each 

40 construct. 



Strain validation 
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A strain of type 4 was used in cloning and challenge methods which is the strain 
from which the S. pneumoniae genome was sequenced. A freeze dried ampoule of a 
homogeneous laboratory strain of type 4 5. pneumoniae strain NCTC 11886 was 
5 obtained from the National Collection of Type Strains. The ampoule was opened and 
the cultured re suspended with 0.5 ml of tryptic soy broth (0.5% glucose, 5% 
blood). The suspension was subcultured into 10 ml tryptic soy broth (0.5% glucose, 
5% blood) and incubated statically overnight at 37 °C. This culture was streaked on 
to 5 % blood agar plates to check for contaminants and confirm viability and on to 
10 blood agar slopes and the rest of the culture was used to make 20% glycerol stocks. 
The slopes were sent to the Public Health Laboratory Service where the type 4 
serotype was confirmed. 

A glycerol stock of NCTC 11886 was streaked on a 5% blood agar plate and 
15 incubated overnight in a C02 gas jar at 37 °C. Fresh streaks were made and optochin 
sensitivity was confirmed. 

Pneumococcal challenge 

20 A standard inoculum of type 4 S. pneumoniae was prepared and frozen down by 

passaging a culture of pneumococcus lx through mice, harvesting from the blood of 
infected animals, and grown up to a predetermined viable count of around 10 9 
cfu/ml in broth before freezing down. The preparation is set out below as per the 
flow chart. 

25 

Streak pneumococcal culture and confirm identity 



30 

Grow over-night culture from 4-5 colonies on plate above 



V 

35 

Animal passage pneumococcal culture 
(i.p. injection of cardiac bleed to harvest) 



40 



Grow over-night culture from animal passaged pneumococcus 
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I 

V 

Grow day culture (to pre-determined optical density) from over-night of animal 
passage and freeze down at -70 °C - This is standard minimum 



Thaw one aliquot of standard inoculum to viable count 

i 

V 

Use standard inoculum to determine effective dose (called Virulence Testing) 

I 

V 

All subsequent challenges - use standard inoculum to effective dose 

An aliquot of standard inoculum was diluted 500x in PBS and used to inoculate the 
mice. 

Mice were lightly anaesthetised using halothane and then a dose of 1.4 x 10 5 cfu of 
pneumococcus was applied to the nose of each mouse. The uptake was facilitated by 
the normal breathing of the mouse, which was left to recover on its back. 

S. pneumoniae vaccine trials 

Vaccine trials in mice were carried out by the administration of DNA to 6 week old 
CBA/ca mice (Harlan, UK). Mice to be vaccinated were divided into groups of six 
and each group was immunised with recombinant pcDNA3.1 + plasmid DNA 
containing a specific target-gene sequence of interest. A total of 100 jug of DNA in 
Dulbecco's PBS (Sigma) was injected intramuscularly into the tibialis anterior 
muscle of both legs (50 pri in each leg). A boost was carried using the same 
procedure 4 weeks later. For comparison, control groups were included in all 
vaccine trials. These control groups were either unvaccinated animals or those 
administered with non~recombinant pcDNA3.1+ DNA (sham vaccinated) only, 
using the same time course described above. 3 weeks after the second immunisation, 
all mice groups were challenged intra-nasally with a lethal dose of 5. pneumoniae 
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serotype 4 (strain NCTC 11886). The number of bacteria administered was 
monitored by plating serial dilutions of the inoculum on 5% blood agar plates. A 
problem with intranasal immunisations is that in some mice the inoculum bubbles out 
of the nostrils, this has been noted in results table and taken account of in 
5 calculations. A less obvious problem is that a certain amount of the inoculum for 
each mouse may be swallowed. It is assumed that this amount will be the same for 
each mouse and will average out over the course of innoculations. However, the 
sample sizes that have been used are small and this problem may have significant 
effects in some experiments. All mice remaining after the challenge were killed 3 or 

10 4 days after infection. During the infection process, challenged mice were monitored 
for the development of symptoms associated with the onset of 5. pneumoniae 
induced-disease. Typical symptoms in an appropriate order included piloerection, 
an increasingly hunched posture, discharge from eyes, increased lethargy and 
reluctance to move. The latter symptoms usually coincided with the development of 

15 a moribund state at which stage the mice were culled to prevent further suffering. 

These mice were deemed to be very close to death, and the time of culling was used 
to determine a survival time for statistical analysis. Where mice were found dead, 
the survival time was taken as the last time point when the mouse was monitored 
alive. 

20 

Interpretation of Results 

A positive result was taken as any DNA sequence that was cloned and used in 
challenge experiments as described above which gave protection against that 

25 challenge. Protection was taken as those DNA sequences that gave statistically 

significant protection (to a 95% confidence level (p<0.05)) and also those which were 
marginal or close to significant using Mann-Whitney or which show some protective 
features for example there were one or more outlying mice or because the time to the 
first death was prolonged. It is acceptable to allow marginal or non-significant results 

30 to be considered as potential positives when it is considered that the clarity of some of 
the results may be clouded by the problems associated with the administration of 
intranasal infections. 
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Table 1 



4101.1 

ATGGAAGAGTTAGTGACCTTAGATTGTTTGTTTATTGACAGAACTAAGATTGAAGCCAATGCCAACAAGTATAGTT 

TTGTGTGGAAGAAAACGACAGAGAAATTCTCCGCCAAACTTCAAGAACAGATACAGGTCTATTTTCAAGAAGAAA 

TCACTCCCCnTCTGATTAAATATGCCATGTTTGATAAGAAACAAAAGAGAGGGTATAAAGAGTCAGCTAAAAACT 

TAGCGAATTGGCACTATAATGACAAGGAGGATAGCTACACACATCCTGATGGCTGGTATTATCGTTTTCACCATAC 

CAAATATCAGAAAACACAGACAGACTTTCAACAAGAAATCAAGGTTTACTACGCCGACGAACCTGAATCAGCCCC 

TCAAAAGGGACTGTATATGAACGAACGCTATCAAAACTTGAAAGCT^ 

AGGTAGACAGATTTTCGCTCAACGCAAGATTGATGTGGAACCTGTCTTTGGGCAGATAAAGGCTTCTTTGGGTT 

AAGAGATGTAATCTGAGAGGGAAGCGTCAAGTGAGAATTGACATGGGATTGGTACTTATGGCCAATAACCTCCTA 

AAATATAGTAAAATGAAATAA 

4101.3 

ATGGGGAAAGGCCATTGGAATCGGAAAAGAGTTTATAGCATTCGTAAGTTTGCTGTGGGAGCTTGCTCAGTAATG 

ATTGGGACTTGTGCAGTTTTATTAGGAGGAAATATAGCTGGAGAATCTGTAGTTTATGCGGATGAAACACTTATTA 

CTCATACTGCTGAGAAACCTAAAGAGGAAAAAATGATAGTAGAAGAAAAGGCTGATAAAGCITTGGAAACT^ 

AATATAGTTGAAAGGACAGAACAAAGTGAACCTAGTTCAACTGAGGCTATTGCATCTGAGAAGAAAGAAGATGAA 

GCCGTAACTCCAAAAGAGGAAAAAGTGTCTGCTAAACCGGAAGAAAAAGCTCCAAGGATAGAATCACAAG 

AAATCAAGAAAAACCGCTCAAGGAAGATGCTAAAGCTGTAACAAATGAAGAAGTGAATCAAATGATTGAAGACA 

GGAAAGTGGATTTTAATCAAAATTGGTACTTTAAACTCAATGCAAATTCTAAG 

ACGTATCTACGTGGAAAAAATTAGATTTACCGTATGACTGGAGTATCTTTAACGATTTCGATCATGAATCT 

ACAAAATGAAGGTGGACAGCTCAACGGTGGGGAAGCTTGGTATCGCAAGACTTTCAAACTAGATGAAAAAGACCT 

CAAGAAAAATGTTCGCCTTACTTTTGATGGCGTCTACATGGATTCTCAAGTTTATGTCAATGGTCAGTTAGTGGGG 

CATTATCCAAATGGTTATAACCAGTTCTCATATGATATCACCAAATACCTTCAAAAAGATGGTCGTGAGAATGTGA 

TTGCTGTCCATGCAGTCAACAAACAGCCAAGTAGCCGTTGGTATTCAGGAAGTGGTATCTATCGTGATGTGACTTT 

ACAAGTGACAGATAAGGTGCATGTTGAGAAAAATGGGACAACTATTTTAACACCAAAACTTGAAGAACAACAACA 

TGGCAAGGTTGAAACTCATGTGACCAGCAAAATCGTCAATACGGACGACAAAGACCATGAACTTGTAGCCGAATA 

TCAAATCGTTGAACGAGGTGGTCATGCTGTAACAGGCTTAGTTCGTACAGCGAGTCGTACCTTAAAAGCACATGA 

ATCAACAAGCCTAGATGCGATTTTAGAAGTTGAAAGACCAAAACT^ 

TACGAATTGATTACGCGTGTTTACCGTGACGGTCAATTGGTTGATGCTAAGAAGGATTTGTTTGGTTACCGTTACT 

ATCACTGGACTCCAAATGAAGGTTTCTCTTTGAATGGTGAACGTATTAAATTCCATGGAGTATCCTTGCACCA 

CCATGGGGCGCTTGGAGCAGAAGAAAACTATAAAGCAGAATATCGCCGTCTCAAACAAATGAAGGAGATGGGAG 

TTAACTCCATCCGTACAACCCACAACCCTGCTAGTGAGCAAACCnTGCAAATCGCAGCAG AACT AGGTTTACT 

TCAGGAAGAGGCCTTTGATACGTGGTATGGTGGCAAGAAACCTTA^ 

CACTCACCCAGAAGCTCGAAAAGGTGAAAAATGGTCTGATTTTGACCTACGTACCATGGTCGAAAGAGGCAAAAA 

CAACCCTGCTATCnTCATGTGGTCAATTGGTAATGAAATAGGTGAAGCTAATGGTGATGCCCACTCTTTAGCAACT 

GTTAAACGTTTGGTTAAGGTTATCAAGGATGTTGATAAGACTCGCTATGTTACCATGGGAGCAGATAAATTCC 

TCGGTAATGGTAGCGGAGGGCATGAGAAAATTGCTGATGAACTCGATGCTGTTGGATTTAACTATTCT 

ATTACAAAGCCCTTAGAGCTAAGCATCCAAAATGGTTGATTTATGGATCAGAAACATCTTCAGCTACCCGTACACG 

TGGAAGTTACTATCGCCCTGAACGTGAATTGAAACATAGCAATGGACCTGAGCGTAATTATGAACAGTCAGATTA 

TGGAAATGATCGTGTGGGTTGGGGGAAAACAGCAACCGCTTCATGGACTTTTGACCGTGACAACGCTGGCTATGC 

TGGACAGTTTATCTGGACAGGTACGGACTATATTGGTGAACCTACACCATGGCACAACCAAAATCAAACTCCTGTT 

AAGAGCTCTTACTTTGGTATCGTAGATACAGCCGGCATTCCAA 

TTTCTGTTAAGAAGAAACCGATGGTACACCTTCTTCCTCACTGGAACTGGGAAAACAAAGAATTAGCATCCAAAG 

TAGCTGACTCAGAAGGTAAGATTCCAGTTCGTGCTTATTCGAATGCCT 

ATCTCTTGGTCTTAAGACTTTCAATAAAAAACAAACCAGCGATGG 

TGAACTTTATCTTGAATGGAAAGTTGCCTATCAACCAGGTACCrrrGGAAGCAATTGCTCGTGATGAATCTGGCAAG 

GAAATTGCTCGAGATAAGATTACGACTGCTGGTAAGCCAGCGGCAGTTCGTCTTATTAAGGAAGACCATGCGATT 

GCAGCAGATGGAAAAGACTTGACTTACATCTACTATGAAATTGTTGACAGCCAGGGGAATGTGGTTCCAACTGCT 

AATAATCTGGTTCGCTTCCAATTGCATGGCCAAGGTCAACTGGTCGGTGTAGATAACGGAGAACAAGCCAGCCGT 

GAACGCTATAAGGCGCAAGCAGATGGTTCTTGGATTCGTAAAGCATTTAATGGTAAAGGTGTTGCCATTGTCAAAT 

CAACTGAACAAGCAGGGAAATTCACCCTGACTGCCCACTCTGATCT 

TGGTAAGAAAGAAGGACAAGAGAAGACTGTTTTGGGGACAGAAGTGCCAAAAGTACAGACCATTATTGGAGAGG 

CACCTGAAATGCCTACCACTGTTCCGTTTGTATACAGTGATGGTAGCCGTGCAGAACGTCCTGTAACCTGGTCTTC 

AGTAGATGTGAGCAAGCCTGGTATTGTAACGGTGAAAGGTATGGCTGACGGACGAGAAGTAGAAGCTCGTGTAGA 

AGTGATTGCTCTTAAATCAGAGCTACCAGTTGTGAAACGTATTGCTCCAAATACTGACTTGAATTCTGTAGACAAA 

TCTGTTTCCTATGTTTTGATTGATGGAAGTGTTGAAGAGTATGAAGTGGACAAGTGGGAGATTGCCGAAGAAGATA 

AAGCTAAGTTAGCAATTCCAGGTTCTCGTATTCAAGCGACCGGTTATTTAGAAGGTCAACCAATTCATGCAACCCT 

TGTGGTAGAAGAAGGCAATCCTGCGGCACCTGCAGTACCAACTGTAACGGTTGGTGGTGAGGCAGTAACAGGTCT 

TACTAGTCAAAAACCAATGCAATACCGCACTCTTGCITATGGAGCTAAGTTGCCAGAAGTCACAGCAAGTGCTAA 

AAATGCAGCTGTTACAGTTCTTCAAGCAAGCGCAGCAAACGGCATGCGTGCGAGCATCTTTATTCAGCCTAAAGA 

TGGTGGCCCTCTTCAAACCTATGCAATTCAATTCCITGAAGAAGCGC 

GAAAAAGCTGACAGTCTCAAAGAAGACCAAACTGTCAAATTGTCGGTTCGAGCTCACTATCAAGATGGAACGCAA 
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GCTGTATTACCAGCTGATAAAGTAACCTTCTCTACAAGTGGTC 

GAGTTGCATAAGCCAGGAGCAGTCACTCTGAACGCTGAATATGAGGGAGCTAAAGACCAAGTTGAACTCACTATC 
CAAGCCAATACTGAGAAGAAGATTGCGCAATCCATCCGTCCTGTAAATGTAGTGACAGATTTGCATCAGGAACCA 
AGTCTTCCAGCAACAGTAACAGTTGAGTATGACAAAGGTTTCCCTAAAACTCATAAAGTCACTTGGCAAGCT 
5 CGAAAGAAAAACTAGACTCCTATCAAACATTTGAAGTACTAGGTAAAGTTGAAGGAATTGACCTTGA^ 

CAAAAGTCTCTGTAGAAGGTATCGTTTCAGTTGAAGAAGTCAGTGTGACAACTCCAATCGCAGAAGCACCACAAT 
TACCAGAAAGTGTTCGGACATATGATTCAAATGGTCACGTTTCATCAGCTAAGGTTGCATGGGATGCGATTCGTCC 
AGAGCAATACGCTAAGGAAGGTGTCTTTACAGTTAATGGTCGCTTAGAAGGTACGCAATTAACAACTAAACTTCA 
TGTTCGCGTATCTGCTCAAACTGAGCAAGGTGCAAACATTTCTGACCAATGGACCGGTTCAGAATTGCCACnTGCC 

10 TTTGCTTCAGACTCAAATCCAAGCGACCCAGTTTCAAA 

CCAATCGTTGGACAAACTGGAATCGTACTAATCCAGAAGCTTCAGTCGGTGTTCTGTTTGGAGATTCAGGTATC^ 
GAGCAAACGCTCCGTTGATAATCTAAGTGTCGGATTCCATGAAGACCATGGAGTTGGTGTACCGAAGTCTTATGTG 
ATTGAGTATTATGTTGGTAAGACTGTCCCAACAGCTCCTAAAAACCCTAGTTTTGTTGGTAATGAGGACCATGTCT 
TTAATGATTCTGCCAACTGGAAACCAGTTACTAATCTAAAAGCCCCTG 

15 ACTTTAGCTTTGATAAAGTTGAAACCTATGCTGTTCGTATTCGCATGGTTAAAGCAGATAA 

TATCACAGAGGTACAAATCTTTGCGAAACAAGTTGCGGCAGCCAAGCAAGGACAAACAAGAATCCAAGTTGACGG 
CAAAGACrTAGCAAACTTCAACCCTGATTTGACAGACTACTACCTTGAGTCTGTAGATGGA^ 

ACAGCAAGTGTTAGCAACAATGGTCTCGCTACCGTCGTTCCAAGCGTTCGTGAAGGTGAGCCAGTTCGTGTCATCG 
CGAAAGCTGAAAATGGCGACATCTTAGGAGAATACCGTCTGCACTTCACTAAGGATAAGAGCTTACTTTCTCATA 
20 AACCAGTTGCTGCGGTTAAACAAGCTCGCTTGCTACAAGTAGGTCAAGCACTTGAATTGCCGACTAAGGTTCCAGT 
TTACTTCACAGGTAAAGACGGCTACGAAACAAAAGACCTGACAGTTGAATGGGAAGAAGTTCCAGCGGAAAATCT 
GACAAAAGCAGGTCAATTTACTGTTCGAGGCCGTGTCCTTGGTAGTAACCTTGTTGCTGAGATCACTGTACGAGTG 
ACAGACAAACTTGGTGAGACTCTTTCAGATAACCCTAACTATGATGAAAACAGT 

ACCAATGATATTGACAAAAACTCTCATGACCGCGTTGACTATCTCAATGACGGAGATCATTCAGAAAATCGTCGTT 

25 GGACAAACTGGTCACCAACACCATCTTCTAATCCAGAAGTATCAGCGGGTGTGATTTTCCGTGAAAATGGTAAGA 
TTGTAGAACGGACTGTTACACAAGGAAAAGTTCAGTTCTTTGCAGATAGTGGTACGGATGCACCATCTAAACTCGT 
TTTAGAACGCTATGTCGGTCCAGAGTTTGAAGTGCCAACCTACTATTCAAACTACCAAGCCTACGACGCAGACCAT 
CCATTCAACAATCCAGAAAATTGGGAAGCTGTTCCTTATCGTGCGGATAAAGACATTGCAGCTGGTGATGAAATC 
AACGTAACATTTAAAGCTATCAAAGCCAAAGCTATGAGATGGCGTATGGAGCGTAAAGCAGATAAGAGCGGTGTT 

30 GCGATGATTGAGATGACCTTCCTTGCACCAAGTGAATTGCCTCAAGAAAGCACTCAATCAAAGATTCTTGTAGATG 
GAAAAGAACTTGCTGATTTCGCTGAAAATCGTCAAGACTATCAAATTACCTATAAAGGTCAACGGCCAAAAGTCT 
CAGTTGAAGAAAACAATCAAGTAGCTTCAACTGTGGTAGATAGTGGAGAAGATAGCTTTCCAGTACTTGTTCGCCT 
CGTTTCAGAAAGTGGAAAACAAGTCAAGGAATACCGTATCCACTTGACTAAGGAA 
AAACCAGTTTCTGAGAAGACAGTTGCTGCTGTACAAGAAGATCTTCCAAAAATCGAATTTGTTGA 

35 G C AT AC AAG AC AGTTG AG AAAAAAG ATTC AAC A CTGT ATCT AG GTG A AACTCGTGT AG AAC AAG A AG G AAAAGTT 

GG AAA AG AACGT ATCTTT AC AG CG ATT AATCCTG ATG G AAGT AAGG AAG AAAAACT C CGTG AAGTG GT AG AAGTT 
CCGACAGACCGCATCGTCnTGGTTGGAACCAAACCAGTAGCTCAAGAAGCTAAAAAACCACAAGTGTCAGAAAAA 
GCAGATACAAAACCAATTGATTCAAGTGAAGCTAGTCAAACTAATAAAGCCCAGTTACCAAGTACAGGTAGTGCG 
GCAAGCCAAGCAGCAGTAGCAGCAGGTTTAACTCTTCTAGGTTTGAGTGCAGGATTAGTAGTTACTAAAGGTAAA 

40 AAAGAAGACTAG 



4101.5 

ATGGATGCAATCITTGACCTAATCGGAAAGGTTTTCAATCCCATCITAGAAATGGGTGGACCTGTCATCATG 

45 TCATTTTGACAGTATTGGCITTACITTTrGGAGTGAAATTCT 

CGCTCTTACAGGTATCGGTGCTATCATCGGTATGCTAAACACTGCTTTCTCAGCATCACTAGCAAAATTCGT^ 
AACACrGGTATCCAATTGAGTATTACCGACGTTGGTTGGGCACCACTTGCTACAATCACITGGGGTTCTGC^ 
CACTATACHTCnTGCTCATCATGTTGATTGTCAACATAGTGATGCTAGCTATGAAGAAAACAGATACACTTGATGT 
CGATATCTTTGATATCTGGCACnTGTCTATCACAGGTCTCnTGATTAAATGGTATGCTGATAACAATGGTGTGAGT 

50 CAAGGGGTTTCACTCTTTATTGCTACAGCAGCTATCGTCCT^^ 

GAAACCTACATTTGATGACCTTCITAACGCCCCAAGTTCATCACCAATGACATCAACTCACATGAACTACATGATG 
AACCCAGTTATCATGGTTTTGGATAAGATTTTTGAAAAATTCTTCCCAGGCCTTGATAA^ 
CTAAATTGAACAAGAAAATCGGTTTCTGGGGATCTAAATTCITCATCGGTTTCATCCTTGGTATCG 
ATGGGAACTCCACATCCAATTGCAGGTGTTGCAGATGCAGATAAATGGCGTCTTGTTATCAAAGGATGGTTGTCTC 

55 TTGGTTTGACTGCCGGTGTATCITTGGAACTCrTCTCACTTATCGGTTCATGGTTCATCGCAGCCG 

TCACAAGGTATTACAAACGTTGCTACTAAACGTCnTCAAGGACGTAAATTCAATATCGGTCITGACT 
TCGCTGGTCGTGCTGAAATCTGGGCTTGTGCCAACGTACTTGCACCAATCATGTTC 
AAAAGTTGGAAATGGTATCnTGCCACTTGCAGGTATCATCGCTATGGGTGTTAC 
CGTGGTAAATTGCTCCGTATGATTATCTTCGGAACACTCTTGTTGCCACTCTT^ 

60 ACCATITGCAACAGAACTTGCTAAAGGTGTAGGTGCCnTCCCAGAAGGTGTGAGCCAAACTCAATTGATTACTCAC 
TCTACTCrTGAAGGACCAATCGAAAAACTTCTTGGTrGGACAATTGGTAACACTACAACrGGTGATATC 
TCCITGGTGCAGTAGTCTTCCITGTATTCTATATCGGTATCnTTGCTTGGTACAGAAAACAAATGATCAAACGTAA 
CGAAGAGTACGCAGCAAAAGCAAAATAA 

65 4102.1 
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ATGAAGATTATGAAAAAAAAATATTGGACTTTAGCGATATTATTCTTTTGTTTGTTCAATA 
AGAAATACCTAAAAATCTTGATGGCAATATAACTCACACTCAGACTAGCGAAAGTTTTTCT 
ACAGGTTGACTATTCTAATAAAAATCAAGAAGAAGTAGACCAAAATAAATTTCGTATTCAAATCG 
ATTATTTGTAACAACAGATAAACATTTAGAAAAAAACTGTTGTAAATTGGAACTTGAA 

TATTGTTAACTCTGAAAGTAATAATTTACTAGGCGAAGATAATTTAGATAATAAAATTAAGGAAAATGTTTCT 

CTAGATAATAGAGGAGGAAATATAGAGCATGACAAAGATAACTTAGAATCGTCGATTGTAAGAAAATATGAATGG 

GATATAGATAAAGTTACTGGTGGAGGCGAAAGTTATAAATTATATT 

TTTTAGATTCAGGAGTCGATTTACAAAATACTGGATTACTGAAAAATCTTTCAAATCACT 

CAATAAAGGATATTTAGGAAAAGAGGAGGGAGAGGAAGGAATAATATCAGATATTCAAGATAGATTAGGTCATG 

GTACGGCTGTTGTAGCTCAAATTGTAGGGGATGACAATATTAATGGAGTAAATCCTCACGTTAATATTAACGTCT 

TAGAATATTTGGTAAGTCGTCAGCTAGTCCAGATTGGATTGTAAAAGCAATTTTTGATGCTGTAGATGATGGCAAT 

GATATTATCAATCTTAGTACTGGACAATATTTAATGATTGATGGAGAATATGAGGACGGAACAAATGATTTTGAAA 

CATTTTTGAAGTATAAAAAGGCTATTGATTACGCGAATCAAAAAGGAGTAATTATAGTAGCTGCATTA 

ACTCCCTAAATGTATCAAATCAGTCAGATTTATTGAAACTTAT^^ 

AGTAGTTGATGTTCCAAGTTATTTCTCATCTACAATTTCGGTCGGAGGCATAGATCGCTTAG 

GTAATTTATCAGATTTTAGCAATAAAGGGGATTCTGATGCAATATATGCGCCTGCAGGCTCAACATTATCT 

AGAATTAGGACTTAATAACTTTATTAATGCAGAAAAATATAAAG^ 

ATATACGTATCITTATGGAAACTCATTTGCTGCTCCTAAAGTTTCTGGTGCGATTGCAA 

AATTAAAAGATCAGCCCTATAATTATATGTTTGTAAAAAAATTCTGGAAGAAACATTACCAGTAA 

4106.1 

ATGAAGAAAACATGGAAAGTGTTTTTAACGCTTGTAACAGCTCTTGTAGCTGTTGTGCTTGTGGCCT 

GAACTGCirCTAAAGACAACAAAGAGGCAGAACTTAAGAAGGTTGACTTTATCCTAGACTG 

ACCACACAGGGCTTTATGTTGCCAAGGAAAAAGGTTATTTCAAAGAAGCTGGAGTGGATGTTGATTTG 

CACCAGAAGAAAGTTCTTCTGACTTGGTTATCAACGGAAAGGCACCATTTGCAGTGTATTTCC 

TAAGAAATTGGAAAAAGGAGCAGGAATCACTGCCGTTGCAGCTATTGTTGAACACAATACATCAGGAATCATCTC 

TCGTAAATCTGATAATGTAAGCAGTCCAAAAGACrTGGTTGGTAAGAAATATGGGACATGGAATGACCCAACTGA 

ACTTGCTATGTTGAAAACCTTGGTAGAATCTCAAGGTGGAGACTTTGAGAAGGTTGAAAAAG 

CTCAAACTCAATCACACCGATTGCCAATGGCGTCTTTGATACT^ 

GCTAAATCTCAAGGTGTAGATGCTAACTTCATGTACTTGA 

TTATCATCGCAAACAACGACTATCTGAAAGATAACAAAGAAGAAGCTCGCAAAGTCATCCAAGCCATCAAAAAAG 
GCTACCAATATGCCATGGAACATCCAGAAGAAGCTGCAGATATTCTCATCAAGAATGCACCTGAACTCAAGGAAA 
AACGTGACITTGTCATCGAATCTCAAAAATACTTC 

TTGACGCAGCTCGCTGGAATGCTTTCTACAAATGGGATAAAGAAAATGGTATCCTTAAA 
AAGGCTTCACCAACGAATTTGTGAAATAA 

4106.4 

ATGATAAAAAATCCTAAATTATTAACCAAGTCTTTTTTAAGAAGTTTTGCAATTCT 
TTCATATAGCTATTTATTTGACCITTCCITTTTATTATATTCAACT 

AGTGTTTACGGAGTATTTAAAGACTAAGACATCTGATGAAATTCCAAGCTTACTCCAGTCTTATTCAAAGT 
ACCATATCTGCTCACCTTAAAAGAGATATTGTAGATAAGCGGCT^^ 

GAAAGCTATCAAATTATATCGTGATGTTAGATATGTCTGTTAGTACAGCAGATGGTAAACAGGTAACCGTGCAATT 

TGTTCACGGGGTGGATGTCTACAAAGAAGCAAAGAATATTTTGCTTTTGTATCTCCCAT 

ATTGCTTTTTCCTTTGTTTTTTCTTAT 

AGTAAAATGCAAGATTTGGATGACAATATTCGTTTTGATGAAAGTAGGAAAGATGAAGTTGGTGAAGTTGGAAAA 

CAGATTAATGGTATGTATGAGCACTTGTTGAAGGTTATTTATGAGTTGGAAAGTCGTAATGAGCAAATTGTAAAAT 

TGCAAAATCAAAAGGTTTCCTTTGTCCGCGGAGCATCACATGAGTTGAAAACCCCTTTAGCCA 

CCTAGAGAATATGCAGCATAATATTGGAGATTACAAAGATCATCCAAAATATATTGCAAAGAGTATAAATAAGAT 

TGACCAGATGAGCCACITATTAGAAGAAGTACTGGAGTCTTCTAAATTCCAAGAGTGGACAGAGTGTCGTGAGAC 

CTTGACTGTTAAGCCAGTTTTAGTAGATATTTTATCACGTTATCAAGAATTAGCTCATTCAATAG 

AAAATCAATTGACAGATGCTACCAGGGTCGTCATGAGTCTTAGGGCATTGGATAAGGTTTTGACAAACCTGATTA 

GTAATGCAATTAAATATTCAGATAAAAATGGGCGTGTAATCATATCCGAGCAAGATGGCTATCTCTCTATCAAAA 

ATACATGTGCGCCTCTAAGTGACCAAGAACTAGAACATTTATTTGATATATTCTATCATTCTCAAATCGTGACAGA 

TAAGGATGAAAGTTCCGGTTTGGGTCTTTACATTGTGAATA 

CTCCCTTATGAACACGGTATGGAATTTAAGATTAGCTTGTAG 

4106.6 

ATGTATTTAGGAGATTTGATGGAGAAAGCCGAGTGTGGTCAATTTTCAATACTTTCCTTTCTATTAC 

AGACGACCGTCAAGGCTGTAATGGAAGAAACAGGATTTTCAAAAGCAACCCTAACCAAATATGTCACCCTGCTCA 

ATGACAAGGCTTTGGATAGTGGCTTAGAGCTGGCTATTCACTCAGAAGATGAAAATCTGCGTCTGTCTATCGGTGC 

AGCTACCAAGGGGAGAGATATTCGGAGCTTGTTTTTGGAGAGTGCTGTTAAATACCAGATTTTGGTT^ 

TACCACCAACAGTTTTTAGCCCATCAGCTGGCTCAAGAATTGGTGATTAGCGAGGCTACGCTTGGTCGTCACTTGG 

CTGGTTTAAATCAGATTTTGTCAGAATTTGATTTATCCATCCAAAATGGCCGTTGGCGAGGTCCAGAGCATCAGAT 

TCACTATTTCTATTTCTGTCTTTTCCGAAAGGTCTGGTCGAGTCAGGAATGGGAAGGTCACATGCAGAAACCAGAG 

AGAAAACAGGAGATTGCCAATTTAGAGGAAATCTGCGGTGCAAGTTTGTCTGCGGGGCAGAAATTGGACTTGGTT 



SUBSTITUTE SHEET (RULE 2B) 



WO 00/06737 



32 



PCT/GB99/02451 



CTCTGGGCTCACATCAGTCAACAACGTCTTCGGGTCAATGCTTGTCAGTTTCAAGTCATAGAAGAGAAAATGCGA 
GGGTATTTTGACAATATCTTTTATCTTCGTT^^ 

AGGAGTTGAGGATGGTGAGATGATGATATTCTTCTCTTTTCTCCTATCTCA^^ 

AGTATATTCTTGGTTTTGGAGGGCAGTTGGCAGATTTACTGACGCAATTGATTCAAGAAATGAAGAAGGAG 
5 TATTGGGGGATTATACAGAGGACCATGTCACCTATGAACTCAGTCAGCTTTGTGCTCAAGTCTATCTCTATAAGGG 
CTATATTTTACAGGATCGCTACAAGTACCAGTTAGAGAATCGTCATCC 
GAGACAGCAGAGGAGATTTTTCATGCTCTACCTGCTTTTCAACAGGGGACAGATTTA 

AATGGCTCCAGTTAATCGAATATATGGCTGAAAACGGTGGCCAGCATATGCGGATTGGTCTGGATTTGACATCTGG 
TTTTCTTGTCTTTTCAAGGATGGCAGCCATTTTGAAACGGTATTTG^ 
10 ATGACCCTAGTCGGCATTATGATTTGCTGGTTACCAATAACCCGATTCATAAGAAGGAACAGACACCAGTCTATTA 
TTTAAAAAATGACTTGGATATGGAGGATTTGGTAGCGATTCGCCAGTTATTATTCACITAA 

4106.7 

ATGGAATTTTCAAAGAAAACACGTGAATTGTCAATTAAAAAAATGCAGGAACGTACCCTGGACCT 

15 GGTGGAGGAATCACAGGAGCTGGTGTAGCCTTGCAGGCGGCAGCTAGCGGTCITGAGACTGGTTTGATTGAAATG 
CAAGACTTTGCAGAAGGAACATCTAGTCGTTCAACAAAATTGGTTCACGGAGGACTTCGTTACCTCAA^ 
ACGTAGAAGTGGTCTCAGATACGGTTTCTGAACGTGCAGTGGTTCAACAAATCGCTCCACACATTCCAAAATCAG 
ATCCAATGCTCTTACCAGTTTACGATGAAGATGGAGCAACCTTTAGCCrCTTCCGTCTTAAAG 
GTACGACCTCTTGGCAGGTGTTAGCAACACACCAGCTGCGAACAAGGTTTTGAGCAAGGATCAAGTCTTGGAACG 

20 CCAGCCAAACTTGAAGAAGGAAGGCTTGGTAGGAGGTGGAGTGTATCTTGACTTCCGTAACAACGATGCGCGTCT 
CGTGATTGAAAACATCAAACGTGCCAACCAAGACGGTGCCCTCATTGCCAACCACGTGAAGGCAGAAGGCTTCCT 
CTTTGACGAAAGTGGCAAGATTACAGGTGTTGTAGCTCGTGATCTCTTGACAGACCAAGTGTTTGAAATCAAGGCC 
CGTCTGGTTATTAATACAACAGGTCCTTGGAGTGATAAAGTACGTAATTTGTCTAATAAGGGAACGCAATTCTCAC 
AAATGCGCCCAACTAAGGGAGTTCACTTGGTAGTAGATTCAAGCAAAA 

25 ACACAGGTTTGGGTGACGGTCGTATGGTCTTTGTTCTCCCACGTGAAAACAAGACTTACTTTGGTACAACTGATAC 
AGACTACACAGGTGATTTGGAGCATCCAAAAGTAACTCAAGAAGATGTAGATTATCTACTTGGCATTGTCAACAA 

CCGCTTCCCAGAATCCAACATCACCATTGATGATATCGAAAGCAGCTGGGCAGGTCTTCG 

TCCATTGATTGCAGGGAACAGTGCCTCTGACTATAATGGTGGAAATAACGGTACCATCAGTGATGAAAGCTTTGA 
CAACTTGATTGCGACTGTTGAATCITATCTCTCCAAAGAAAAAACACGTGAAGATGTTGAGTCTGCT 

30 CTTGAAAGTAGCACATCTGAGAAACATTTGGATCCATCTGCAGTTTCTC^ 

ATGGTCTCTTGACTCTTGCTGGTGGTAAAATCACAGACTACCGTAAGATGGCTGAAGGAGCTATGGAGCGCGTGG 
TTGACATCCTCAAAGCAGAATTTGACCGTAGCTTTAAATTC 

ATTGAACCCAGCAAATGTGGATTCAGAAATCGAAGCCTTTGCGCAACTTGGAGTATCACGTGGTTTGGATAGCAA 
GGAAGCTCACTATCTGGCAAATCTTTACGGTTCAAATGCACCGAAAGTCTTTGCACTTGCTCACAGCT^ 
35 GCGCCAGGACTCAGCTTGGCAGATACTTTGTCCCITCACTATGCAATGCG 
ACTTCCTTCTTCGTCGTACCAATCACATGCTCHTTATGCGTC 

GAAATGGGACGATTCTATGACTGGACAGAAGAAGAAAAAGCAACTTACCGTGCTGATGTCGAAGCAGCTCTCGCT 
AACAACGATTTAGCAGAATTAAAAAATTAA 

40 4106.8 

ATGATGAATGAATTATTTGGAGAATTTCTAGGGACTTTAA 

TGGTTCTTCCTAAAACCAAGAGCAATAGCTCAGGTTGGATTGTGATTACTATGGGTTGGGGGATTGCAGTTGCGGT 
TGCAGTCTTTGTATCTGGCAAGCTCAGTCCAGCTTATTTAAACCCAGCTGTGACCATCGGTGTGGCCTTAAAAGGT 
GGTTTGCCTTGGGCTTCCGTTTTGCCTTATATCTTAGCCCAGTTCGCAGGGGCCATGCTGGGTCAGATTTTG 

45 GTTGCAATTCAAACCTCACTATGAGGCAGAAGAAAATGCAGGCAATATCCTGGCAACCTTCAGTACTGGACCAGC 
CATCAAGGATACTGTATCAAACTTGATTAGCGAAATCCTTGGAACTTTTGTTTTGGTGTTGACAATC 
GTCTTTACGACITTCAGGCAGGTATCGGAACCrTTGCAGTGGGAACriTTGATTGTCGGTATCGGTCT 
TGGGACAACAGGTTATGCCTTGAACCCAGCTCGTGACCTTGGACCTCGTATCATGCACAGCATCTTGCCAATTCCA 
AACAAGGGAGACGGAGACTGGTCTTACGCTTGGATTCCTGTTGTAGGCCCTGTTATCGGAGCAGCCTTGGCAGTG 

50 CTTGTATTCTCACTTTTCTAG 

4106.10 

ATGAAAAGGACCTGGAGGAACTCATTCGTGACAAATCTTAATACACCTTTTATGATTGGCAATATTGAGATTCC 
ATCGTACCGTTTTAGCGCCTATGGCTGGCGTGACCAACTCAGCCTTTCGTACTATCGCAAAGGAGCTCGGAGCTGG 

55 ACTCGTTGTAATGGAAATGGTCTCTGACAAGGGAATCCAATACAACAACGAAAAAACCCTGCACATGCTTCATAT 
CGATGAGGGCGAAAACCCTGTCTCTATCCAACTTTTTGGTAGCGATGAAGACAGCCTAGCACGCGCAGCAGAATT 
CATCCAAGAAAACACCAAGACCGATATCGTCGATATCAACATGGGCTGCCCTGTCAACAAAATCGTGAAGAACGA 
AGCTGGTGCTATGTGGCTCAAGGATCCAGACAAGATTTACTCCATCATCAACAAGGTCCAGTCTGTCCTTGATATC 
CCACTTACTGTCAAAATGCGTACCGGCTGGGCGGACCCATCTCHTGCAGTAGAAAATGCTCTCGCTGCTGAAGCTG 

60 CAGGTGTTTCTGCCCTCGCCATGCATGGCCGTACCCGTGAACAAATGTATACTGGCCACGCAGACCTTGAGACCCT 
TTACAAGGTTGCCCAAGCTCTAACCAAGATTCCATTCATCGCCAACGGTGATATCCGTACTGTCCAAGAAGCCAA 
GCAACGCATCGAAGAAGTTGGTGCTGACGCAGTCATGATTGGCCGAGCTGCCATGGGAAATCCTTACCTCTTCAA 
CCAAATCAACCATTACTTTGAAACAGGAGAAATCCTACCTGATTTGACCTTTGAAGACAAGATGAAGATCGCCTA 
CGAACACTTGAAACGATTGATTAACCTCAAAGGAGAAAACGTCGCAGTTCGTGAATTCCGCGGTCTCGCTCCTCA 
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CTATCTCCGTGGAACATCTGGCGCTGCCAAACTCCGTGGAGCCATTTCGCAAGCCAGCACCCTGGCAGAGATTGA 
AACCCTCTTGCAATTGGAGAAGGCTTAA 



4107.1 

ATGACAAAGAAGAAAATTGAGCGTATTTCTGTAATACACCGAGAAAAGATTTTATGGCTCAAGTGGTATTT 

GAGATAAAGAACAACCTAAGTATAGTGTCCTTGAGCGTAAAATGTTTGATGCTGCTAAAAATCAAGATATGCTAG 

CTTATCAAAAATACGCAACTATCAAGCAGATAACAGATATTAGGGTACAAACAAGTGAGGCTGACATTTTAGAGG 

CTGTAAAAGAGGTTTATGTGTACAATCACATGAATGTTATCGGAGCTTGTCAGCGGATATTATTTATCAGTCAATC 

ACCAGCITATGATAAGTTAAATAAGTGGTTTAATATCTATTCTGATTTGTATTTT 

TGGGGGTATATCATGAGATGGTAGGTATCTAG 

4107.2 

ATGAAAAATTCCAACGAGGCTGAGATGAAATTACTTTATACTGATATTCGGACTTCITTGACAGA 

GAGAGGCAGAAGAGCTAGTTGCAGCTGGCAAGCGGGTCTTCTACATTGCCCCCAACTCTC 

ACGCGCCGTGCTGGAATACITGTCCCAGCAGGCTTCTTTTTCGATT^ 

TATCTGGTCTTGAATGATTTACCAGCTAAAACTACTCTTGATGATATCGGTCTTGGGTTGGCCTTTTAC 

TGCCGAACTCGATCCCAAGGACTTGCGTGTTTATGGCGCTATTAAGCAGGATCCTCAATTGATCCAGCAGTTAATT 

GAGCTTTACCATGAGATGACCAAATCTCAGATGAGTTTTTTGGACT^ 

GCGGATTTACTCrTGATTTTTGAGAAAGTAACAGCCTATCTTAATCAAGGTCAGTTAGCCCAGGAAA 

CCCATTTGATTGAGGCTATTGAGAATGACAAGGTAAGTAGTGATTTTAATCAAATCGCCTTGGTCATTGACGGC^ 

TACTCGTTTTTCTGCTGAGGAAGAGCGGGTTGTGGACTTACTTCACGGCAAAGGTGTTGAGATTGTTATC 

TATGCTAGTAAGAAAGCCTATACCAGTCCTTTTAGCGAGGGCAATCTCTACCAAGCCAGCGTAAAATTTCTCCATC 

ATCTGGCTTCTAAATACCAAACGCCTGCTCAGGACTGTTCrrCAAACTCATGAGAAGATGGATAGTTT^ 

CTCTCGTTTGTTGGAGTCITCTTATGACTTTTCA 

CAAATCTGGTCTTGTTTGACGCAAAAGGAGGAGTTGGAGCTAGTAGCCCGTAGTATTCGTCAGAAATTACATGAG 
AACTCAGACCTGAGCTACAAGCATTTTCGTATTCTCTTGGGGGATGTAGCTTCTTACCAGTTATCT 
TTTTTGACCAGTATCAGATTCCTTTTTATCTTGGTAGAAGCGAAGCCATGGCTCATCATCCCTTGACTCA 
GAGTCTATTTTAGCTTTAAAACGTTACCGTTTTCGTCAGGAGGATTTGATTAATC^ 

TGACCTCAGTCAGTCTGATATTGATGCTTTTGAGCAATATATCCGCTATCTTGGTATCAATGGCTTGCC AGCCT TTC 

AGCAAACCITCACCAAATCCCACCATGGAAAATTTAATCTTGAGCGTTTGAATGTCCTCCGCCTC 

ACCTCTTGAAACCCTCTTTGCCAGCCGAAAACAAAAGGCTGAAAAACTCCTACAAAAAT 

AGAAGGAGCTGTGACCAAGCAGTTACAAGATTTGACAACCACTTTGGAAGCTGTAGAACAGGAAAGACAAGCCG 
AAGTTTGGAAGGCTTTCTGCCATGTTTTAGAACAATTTGCGACT^ 

CTTCCTAGCCTTGCTCCATTCTGGAATGAGTTTGTCCCAATACCGTACCATTCCAGCAACAGTGGACACTGTTCTG 
GTGCAGAGTTACGATTTGATTGCACCATTGACTGCTGACTTTGTCTATC 

CAAAAATTTCTCAAAACACCAGTCTTCTGACAGATGAAGAAAGGCAAAACCTAAACCAAGCGACCGAAGAAGGC 

GTTCAATTACTGATTGCCAGCAGTGAAAATCTCAAGAAAAATCGCTACACTATGCTTTCCITGG 

GTAAGCAGTTGTTCTTGTCGGCTCCAAGCCTTTTTAACGAAAGTGAAAGTAAGGAATCTGCCT 

GATCCATTTTGGATTTAGGCGGAGAGAGAAGAGGATGAATCACAAAGGACTGTCTAAGGAGGATATGGGGTCCTA 

TCACAGTCTTTTGTCTAGTCTGGTTGCCTATCACCAGCAGGGTGAGATGAGCGATACTGAGCAAGATTTGACTTTT 

GTCAAGGTTCTGTCGCGTGTCATAGGTAAAAAACTAGATCAGCAAGGTCTGGAAAATCCAGCTATCCCAACCAGT 

CCAAGCAGCAAGACCTTAGCCAAGGACACCTTGCAAGCTCT 

CTATCCTGCCAAACAGGAGTTTTACCTGTCTACGTCGGGTTTGACAGAGTTTTATCGCAATGAATACAGTT^ 

CTACGCTACGTTTTAGGCTTGCAGGAGGAATTACGTTTGCATCCTGATC 

GTATCTTTGAACGCGCCITACAGTTGCCTAATGAAGATTC 

CAGTCAAGAACGCGAATTTGAAGCTATTTATCAAGAAAGTTTGGAAGCCCAGTTT^^ 

GTTGCACGGACAACTGGACATATTCTCCGACACAATCCAGCCATCGAAACCATCAAAGAAGAAGCAAATTTTGGT 
GGAAAAGACCAAGCCTTTATTCAATTAGACAATGGACGCAGTGTCTTTGTACGAGGCAAGGTGGACCGGATTGAC 
CGTTTGAAAGCTAATGGAGCGATAGGAGTAGTAGACTACAAATCCAGTCTGACTCAGTTCCAGTTTCCTCATTTCT 
TTAATGGGCTCAATT(HX:AGTTACCAACCTATCTTC 

CATGTACTTGGAAATGGCTGAACCTGTCCAATCTCTGATGGCGGTAAAAAGTCTGGCAGGAGCAGTGGTAGAAGC 
CAGCAAATCTATGAAATACCAAGGGCTCTTCTTGGAAAAAGAAAGCAGTTATT^ 

CAAGGCTAATCAACTGACAGATGAGGAATTTCAGCTCCTACTGGACTACAATGCCTATCTTTACAAGAAAGCTGCT 

GAGAAGATTTTAGCAGGCCGGTTCGCCATCAATCCTTATACTGAAAATGGCAGAAGCATTGCCCCATACGTCCAG 

CAACATCAGGCTATTACAGGCTTTGAAGCCAATTACCATCTGGGCCAAGCCCGTTTCCTAGAAAAGTTGGAC 

CTGATGGCAAGCGTCTGGTCGGAGAAAAACTCAAGCAAGCTTGGCTTGAAAAAATAA 

GAGAGGAGTTGAATCGATGA 

4107.3 

ATGAAGCTTATTCCCITTTTAAGTGAGGAGGAGATTCAAAAACTGCAAGAAGCAGAAGCAAATTCGAGCAAGGAA 

CAGAAGAAAACTGCCGAGCAAATCGAAGCTATCTACACTTCTGCCCAGAATATCCTGGTCTCAGCATCGGCTGGT 

TCTGGAAAGACCTTTGTCATGGCAGAGCGCATTCTGGACCAATTGGCGCGTGGTGTCGAAATTTCTCAACTCTTTA 



SUBSTITUTE SHEET (RULE 26) 



WO 00/06737 



34 



PCT/GB99/02451 



TCTCAACCITTACCGTCAAGGCTGCAA 

AAACAGATGATGTCGACCTCAAACAACACTTGGGTCGCCAGTTGGCAGACCTACCCAACGCTGCCATTGGAACCA 

TGGATTCTTTCACACAAAAATTCCTTGGCAAACATGGTTATCTGC^ 

AACCAAAGCGAGCAACTTATTCTCGAAAACGAAGTCTTrCATGAGGTCTTTGAAGCG 

AAAGAGACCTTTAGTCATTTGCTGAAAAACTTTGCTGGGCGTGGCAAGGACGAACGG G 

TATAAAATCTATGACTTCCTCCAATCCACCAGTAATCCTCAAAAGTGGCTGAGTGAATCTTTCCTC 

AGAAAGCTGATTTTACCAGTGAAAAAGAAAAACTGACCGAGCAAATCAAACAAGCCCTTTGGG 

TTTTCCGTTACCATCTGGATAACGATGCCAAGGAGTTTGCAAAGGCTGCCTATTTAGAAAATGTTC^ 

GGATGAAATTGGCTCCCTAAATCAGGAGTCCGATAGTCAGGCTTATCAGGCAGTGCTTGCGCGTGTTGTCGCCATC 

TCTAAGGAGAAAAACGGTCGAGCTCTGACTAATGCCAGCCGTAAGGCTGATTTGAAGCCCCTGGCTGATGCCTAC 

AACGAAGAGAGAAAGACCCAGTTTGCTAAACTAGGACAATTATCAGACCAGATAGCGAT 

TCTCGACTATCAAGAACGTTATCATGGAGACACTTGGAAACTAGCTAAAACCTTCCAATCTTTCATGA 

GTAGAGGCTTATCGTCAGAGAAAACGACAGGAAAATGCCTTCGAATTCGCTGATATCAGCCATTACACCATTGAG 

ATTTTAGAGAATTTCCCACAAGTTCGTGAGTCTTATCAGGAGCGCTTCCATGAAGTCATGGTCGATGAGTATCAGG 

ATACCAACCATATTCAAGAACGGATGCTGGAATTGTTGTCTAATGGCCACAATCGCTTTATGGTGGGAGATATCAA 

GCAATCCATCTATCGTTTCCGTCAGGCAGACCCGCAGATTTTCAATGAGAAATTCCAACGCTATGCGCAAAATCCC 

CAAGAAGGCAGGCTCATTATCCTCAAGGAAAATTTCCGTAGTAGTTCAGAAGTGCTGTCAGCAACCAATGATGTC 

TTTGAACGTCTCATGGACCAAGAGGTCGGCGAAATCAACTATGATAACAAGCACCAGCITGTTTTTGCCAATA 

AACTGACTCCCAATCCAGACAACAAGGCAGCATTTCTCCTCTACGACAAGGACGATACAGGTGAGGAAGAAGAGA 

GTCAAACAGAAACGAAACTAACAGGCGAAATGCGCTTAGTTATCAAGGAGATTCTGAAACITCATCAAGAAAA 

GTGTTGCCnTTAAGGAAATTGCCCTTCTGACCTCCAGCCGCAGTCGTAATGACCAGATTCTCCTCGCCCTGTCTGA 

GTACGGAATTCCTGTCAAAACTGACGGAGAGCAAAACAATTATCTCCAATCCCTAGAAGTGCAAGTCATG CTAGA 

CACTCTTCGTGTCATTCACAATCCCCTGCAAGACTACGCCTTGGTTGCCCTTATGAAGTCTCCAATGTTTGGTT^ 

ATGAGGATGAGCTAGCACGTTTGTCCCTTCAGAAAGCAGAGGATAAAGTCCACGAAAATCTCTATGAGAAACTGG 

TCAATGCACAAAAAATGGCAAGTAGTCAAAAAGGCTTGATTCACACAGCTCTAGCTG 

AAAAACTAAAGCAATTCATGGATATCCTAGCTTCITGGCGCT 

TTGGAAGATTTACAACGACCGTTTTTATTATGACTATGTTGGGGCTTTGCCGAATGGTCCT 

CTCTATGCCCTAGCACTGCGTGCTGATCAATTTGAAAAGAGCAATTTCAAAGGTTTGTCGCGTT^ 

TTGACCAAGTCTTAGAAGCCCAGCACGATTTGGCAAGCGTGGCCGTCGCACCGCCAAAAGATGCAGTAGAGCTCA 

TGACCATCCACAAGAGTAAAGGGCTGGAGTTTCCTTACGTCTTTATCCTCAATATGGATCAAGATTTCAACAAGCA 

AGACTCTATGTCAGAAGTCATTCTCAGTCGTCAGAATGGTCTTGGTGTCAAATATATTGCCAAGATGGAGACAGGG 

GCAGTAGAAGACCACTATCCTAAAACCATCAAACTCTCCATTCCTAGTCTGACCTATAGGCAGAACGAAGAGGAA 

TTACAGCTAGCAAGCTATTCTGAGCAGATGCGTTTGCTGTATGTTGCTATGACGCGGGCTGAGAAAAAGCTCTATC 

TTGTCGGCAAGGGTTCTCGTGAAAAGCTGGAATCCAAGGAATACCCAGCAGCCAAAAATGGGAAACTAAATAGCA 

AT ACT AG ACTG C AAG C ACG G AATTTCC AAG ATTGG CTTTGG G CT ATC AGT AAAGTGTTT A CT AAGG AC AAG CTC A 

ACTTTAGTTATCGTTTTATTGGCGAAGATCAGTTGACCAGAGAAGCTATCGGAGAGTTGGAAACCAAGAGTC 

CCAAGATAGCTCCCAAGCAGACAATCGTCAGTCAGATACCATCAAAGAAGCTCTGGAAATGCTGAAGGAGGTGGA 

AGTTTATAATACTCTTCACCGCGCAGCTATTGAACITCCTAGTGTTCAAACCCCAAGT 

GAACCAGTTATGGATATGGAAGGTGTCGAGATTGCTGGTCAAGGTCAGTCAGTAGGCAAGAAAATCAGCTTCGAT 

TTGCCAGATTTTTCAACCAAAGAAAAGGTAACTGGAGCTGAGATTGGTAGTGCTACTCACGAACT 

ATTGACCTCAGCCAGCAACTAACCCTTGCTAGCCTAACAGAAACACTCAAACAAGTTCAAACTAGCCAAGCT 

AGAGACAAGATCAATCITGATAAAATTCTTGCITTCITTGACACAGTACTCGGT 

ACCATCTTTATCGCGAGCAACCTTTCTCCATGCTCAAACGAGACCAAA^ 

TATCCirGATGGCTATCTGCITTACGAAAACAAAATTGTTCTGTTCGACT 

AGTCAACTCGTAGACCGCTATCGTGGTCAGTTAGCTCTATACGAAGAGGCTTTATCACGAGCCTATTCGATTGAAA 
ATATTGAAAAATACnTGATTTTACTCGGTAAAGACGAGGTTCAAGTTGTAAAAGTATAA 

4109.1 

ATGGAACTTGCTCGCCATGCTGAAAGCrTGGGAGTAGATGCTATTGCAACGATTCCACCAATTTATT^ 
CAGAATACTCAGTTGCCAAATACTGGAACGATATCAGTTCTGCAGCTCCAAACACAGACTACGTGATTTACAACA 
TTCCTCAATTGGCAGGGGTTGCITTGACTCCAAGCCITTACACAGAA^ 
GAAGAACTCTTCTATGCCAGTTCAAGATATCCAAACCITTGTCAGCCriTGGTGGAGAA^ 

GGTCCTGATGAGCAGTTCCrAGGAGGACGCCTCATGGGGGCTAGGGCrGGTATCGGTGGTACITATGGTGCrATGC 
CAGAACTCTTCrTGAAACTCAATCAGTTGATTGCG 

CAACGCAATCATTGGTAAACTCACITCTGCrCATGGAAATATGTACGGTGTCATCAAAGAAGTCrrGAA 

GAAGGCTTGAATATTGGATCTGTTCGTTCACCATTGACACCAGTGACTGAAGAAGATCGTCCAGTTGTAGAAGCG 

GCTGCTGCCTTGATTCGTGAAACCAAGGAGCGCTTCCTCTAA 

4110.2 

ATGTATAAGACAAAGTGTTTACGAGAGAAGTTAGTATTATTTTTAAAAATTTTCrr 

TTGCCAATTATTCTGCCTCTTTTGTTGATACTGCAATGACAGGTCAATACAACACT 

ATGGCAACCAGTATCTGGAATCCITTCTTTACATTTCTAACAGGGATTGTGTC^ 

CCATCTrGGTCGAGGCAAAAAGGAAGAAGTTGCGTCrGATTTTTACCAATTTATTTATT^ 

GTGGTCITGCTGGGGATGGTACnTTTCTTGGCACCAATAATCTTGAATCATATTGGGTTAG 

CAGTAGCGGTTCGCTATCTTTGGTTTTTATCTATCGGGATTATCCCCTTGTTGCTCTTTAGCGTCATr 



SUBSTITUTE SHEET (RULE 26) 



WO 00/06737 



PCT/GB99/02451 



35 



TGGATTCGCTGGGCITGACCAAACTGTCCATGTACCTCATC 
CTCTTGATTTACGGTGCCTTTGGTGTTCCAGAACTGGGAGGGGCT 
GGGTCTTGCTTGGGATTTCTGTTCTGGTTTTATTTAAACAGGAGAAGCTCAAA 
TCCACTTAATATGGATAAAATTAAGGAAGGAGTTCGTTTAGGTCTGCCTATTGGGGGAACTGTCT^ 
5 GCTATCTTTTCAGTGGTTGGCnTGATTATGGCTAAGTTTTCGCCCTTGATTATAGCT 
CTTTTCAAGTCnTATGTACGCCTTTCCTATGAGTATCT 

CCAAGCGATTTGATGATGCGAAAACCTATATTGGTCTAGGAAGATGGACTGCCCTCATTTTTGCGGC 
AACCTTCCTTTACATTTTTAGGGGAAATGTGGCCAGTCTTT^^ 
CGTTTTTTAACTTATAGTCTTTTCTTCCAGTTAGCAGATA^ 
10 TAAGGATACAGTTATTCCTTTTTACCTTGGTTTGCTTGGTTATTGGGGCGTAGCAATCCCT 

4112.2 

ATGAGTACTTTAGCAAAAATAGAAGCGCTCTTGTTTGTAGCGGGTGAAGATGGGATTCGGGTCCGCCAGTTAGCT 
GAACTCCTCTCTCTGCCACCGACAGGCATCCAGCAAAGTTTAGGAAAATTAGCCCAGAAGTATGAAAAGGACCCA 
15 GATTCCAGTTTGGCTTTGATTGAGACAAGTGGTGCITATAGATTGGTGACCAAGCCTCAATTTGCAG 
AGGAATACTCTAAGGCGCCTATCAACCAGAGCTTGTCTCGGGCTGCCCTTC 

ACAGCCGATTACGCGGATAGAAATTGATGCCATCCGTGGAGTTAACTCGAGTGGAGCCTTGGCAAAGTTGCAGGC 
TTTTGACCTGATAAAGGAAGACGGGAAAAAGGAAGTATTGGGGCGCCCCAACCTCTATGTGACTACGGATTATTT 
CCTAGATTACATGGGGATAAACCATTTAGAAGAATTACCAGTGATTGATGAGCTTGAGATTCAAGCCCAAGAAAG 
20 CCAATTATTTGGTGAAAGGATAGAAGAAGATGAGAATCAATAA 



ATGGATACGATGATTAGTAGATTTTTTCGCCATTTATTTGAAGCC^ 
CAGTAGCTGCTGTCAGTTCAGTCATGATTACTTTGACCTTGGTGGCAATATT^ 
25 AAACTAGCTACAGATATTGAAAATAATGTCCGTGTAGTAGTTTATATCCGAAAGGATGTGGAAGATAATAGTCAG 
ACAATTGAAAAAGAAGGTCAAACTGTTACAAATAATGACTACCACAAGGTATATGATTCTTTGAAGAACATGTCT 
ACGGTTAAAAGTGTTACCTTTTCAAGTAAAGAAGAACAATATGAAAAATTAACCGAGATAAT 
AAAATCTTTGAAGGAGATGCCAATCCTCrrCTATGATGCCTATATTGTAGAGGCAAACACTCCAAATGATG 
ACTATAGCCGAAGATGCTAAAAAAATTGAAGGTGTCTCTGAGGTTCAAGATGGCGGTGCCAATACAGAAAGACTC 

30 TTCAAGTTAGCTTCATTTATCCGTGTTTGGGGACTAGGGATTC 

TTCAAATACCATTCGTATTACCATTATTTCCCGCAGTCGCGAAATTCAAATCATGCGCTTGGTCGGAGCTAAAAAC 

AGTTATATCCGTGGACCGTTCTTGTTAGAAGGAGCCTTTATCGGTTTATTC 
CTTTATTGTTTATCAAATTGTTTACCAATCTGTCAACAAATCGTTGGTAGGGCAAAATCT 
GATTTATTTAGTCCGTTGATGATTGCCCTACTATTrGTGATC 
35 CATGCGCCGATTCTTGAAGATTTAG 



4117. 1 

ATGAAGAAAGTAAGATTTATTTTTTTAGCTCTGCTATTTTTCTTAGCTAGTCCAGA 

GTACTTGGCAAGGAAAACAGTATCTGAAAGAAGATGGCAGTCAAGCAGCAAATGAGTGGGTTTTTGATACTCATT 
40 ATCAATCITGGTTCTATATAAAAGCAGATGCTAACTATGCTGAAAATGAATGGCTAAAGCAAGGTGACGACT 

TTACCTCAAATCTGGTGGCTATATGGCCAAATCAGAATGGGTAGAAGACAAGGGAGCCTTTTATTATCTTGACCAA 
GATGGAAAGATGAAAAGAAATGCTTGGGTAGGAACTTCCTATGTTGGTGCAACAGGTGCCAAAGTAATAGAAGAC 
TGGGTCTATGATTCTCAATACGATGCTTGGTTTTATATCAAAGCAGATGGACAGCACGCAGAGAAAGAATGGCTC 
CAAATTAAAGGGAAGGACTATTATTTCAAATCCGGTGGTTATCTACTGACAAGTCAGTGGATTAATCAAGCTTATG 

45 TGAATGCTAGTGGTGCCAAAGTACAGCAAGGTTGGCTTTTTGACAAACA^ 

AAATGGAAACTATGCTGATAAAGAATGGATTTTCGAGAATGGTCACTATTATTATCTAAAATCCGGTGGCTACATG 

GCAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTTTTATCTCAAATTTGATGGGAAA^ 

TGGGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCCGGTGGTTACATGACAGCCAATGAATGGATTT 
GGGATAAGGAATCTTGGTTTTATCTCAAATCTGATGGGAAAATAGCTGAAAAAGAATGGGTCTACGA 
50 GTCAAGCTTGGTACTACITCAAATCCGGTGGTTACATGACAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTT 
TTACCTCAAATCTGATGGGAAAATAGCTGAAAAAGAATGGGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTC 
AAATCTGGTGGCTACATGGCGAAAAATGAGACAGTAGATGGTTATCAGCTTGGAAGCGATGGTAAATGGCTTGGA 
GGAAAAACTACAAATGAAAATGCTGCTTACTATCAAGTAGTGCCTGTTACAGCCAATGTTTATGATTCAGATGGTG 
AAAAGCTTTCCTATATATCGCAAGGTAGTGTCGTATGGCTAGATAAGGATAGAAAAAGTGATGACAAGCGCTTGG 

55 CTATTACTATTTCTGGTTTGTCAGGCTATATGAAAACAGAAGATTTAC 

CCCTTATTATGAGAGTGATGGCCACCGTTTTTATCACTATGTGGCTCAGAATGCTAGTATCCCAGTAGCTTCTCAT 
CTTTCTGATATGGAAGTAGGCAAGAAATATTATTCGGCAGATGGCCTGCATTTTGATGGTTTTAAGCTTGAGAATC 
CCITCCTTTTCAAAGATTTAACAGAGGCTACAAACTACAGTGCT 

CATTAACAATAGCCTTTTGGAGAACAAGGGCGCTACTTTTAAGGAAGCCGAAGAACATTACCATATCAATGCT 
60 TATCTCCTTGCCCATAGTGCCCTAGAAAGTAACTGGGGAAGAAGTAAAATTGCCAAAGATAAGAATAATTTCTTTG 
GCATTACAGCCTATGATACGACCCCTTACCnTTCTGCTAAGACATTTGATGATGTGGATAAGGGAATTTTAGGTGC 
AACCAAGTGGATTAAGGAAAATTATATCGATAGGGGAAGAACTTTCCTTGGAAACAAGGCTTCTGGTATGAATGT 
GGAATATGCTTCAGACCCTTATTGGGGCGAAAAAATTGCTAGTGTGATGATGAAAATCAATGAGAAGCTAGGTGG 
CAAAGATTAG 
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4119.2 

ATGAAAAAAGTATTACAAAAATATTGGGCATG 

ATGTGCCGATGTTTCAAGGAGCCTTTTACAGTTTTACCAACT 

CTTAAACAACTTTAAGCTCCTCTTCATGGATCCAA^ 

GCCATGGTGGTTGGTGAGATTGCACTCGGGATCTTCATTGCGCGTGTCTTGAATTCTAAAATCAAAGGCCAAACCT 
TCTTCCGTGCITGGTTCnTCTTCCCAGCTGTTTTATCTGGTTTGACAGTG 
TACGGTCTTCCAGCGATTGGAAATGCCCTTCATATTGAATTT 
CAATCTTTGCGGCTGTCITTGTCCTTCTTTGGCAAGGGGTC 

TCTATTCCAACTGAGATTACAGAGGCAGCAAGGATTGATGGTGCGACTAGCAAGCAAGTTTTCTGGAACATTGAA 

TTGCCTTACTITGCTACCAAGTGTCTCTATGGTCTTTATCCTAGCCCTAAAAGGTGGGCTGACTGCC^ 

CTTTGCCATGACCGGTGGTGGTCCAAACAATGCCACAACCTCACTTGGGCTCTTGGTTTATAACT 

AACAACCAATTCGGTTATGCCAATGCCATTGCCGTAAT<^GTT<^ 

GAGAGTATCTAAGAAATTTGAAATTTAA 

4119.3 

ATGATGAAACAAGATGAAAGAAAAGCCCTGATTGGCAAATACATTCTATTGATTCTAGGATCGGTTCTG 

TGCCGCTCCITGCTACCCTCTTTAGTTCCTTC 

AACTTCACATGGGACAACTTTAGCCGTCTCTTAGCTGATGGGATTGGAGGCTATTATTGGAACTCT 
CTGTCTTGTCTTTACTTGCAGTAATGATCTTT^ 

AAAAGCCnTTACCATCATGTATACCCTCTTAATCCTCGGAATCTTCGTACCTTTCCAAGTCATCATGAT^ 
CGGTTATGATGAGTAAACTCGGTTTGGCTAATACCTTTGGTTTGATCTTGCTCTACT^ 

ACCCTCTTTCTCTATGTTGGCTATATCAAAATCTCGATTCCAGAAAGTCTGGATGAAGCAGCAGAGATCGATGGGG 
CTAATCAATTTACAACCTATTTCCGCATCATCTTCCCAATGATGAAACCGATGCATGCGACAACCATGATCATCAA 
TGCCCTTTGGTTCTGGAATGACTTCATGTTGCCACTCCTTGTCTTGAACCGGGATTCCAAAA 

TGTTCCAATACAACTACGCAGGCCAATATTTCAACGACTACGGACCAAGCTTTGCCTCITACGTGGTCGGCATTAT 
CAGTATCACCATTGTCTATCTCTTCTTCCAACGCCATATCATTTCAGGAATGAGCAACGGGGCAGTGAAGTAA 

4119.4 

ATGAAAAGTATTCTTCAGAAAATGGGGGAGCATCCGATGCTGCTT 

TTCTTGCACAAAATTGGATGGGTCTTGTGGCTTCAGTAGGAA 

TCGATTTTATCCCATAAATTCTTTCGATTGATTTTGCA 

AGTTTAGAACATTTCCAAATTGTGAAGAAATTTAACTATGCTTTTC^ 

ACCGGGCAGAAGTGACCTTCTTTAATCCTAATTATTATGGAATTATTTGTTGTTTCT 

CTGTTTACAACGACCAAGTTGAATTGGTTGAAAGTATTCTGTGTGATTGCAGGCTT TGTTA ATCT 

CirTACTCAAAATCGAACTGCCTTTCCTGCTATTATCGCTGGAGCAATTATCT 

GGAAGGCCTTTTGGCTTAGTATTGGGGTCTTCGCGATTG 

ATGGGTACTTTAGACTCHTCTATGGAAGAACGCATTTCrATCT^ 

CTTTTTGGGGTGAAGGGCCATTGACCTATATGAACTCrTATCCTCGGATACATGCTCCTTATCATGAACATGCCCA 

CAGTCTTTATATTGATACGATTCTGAGTTACGGAATTGTGGGGACT^ 

TTCGCITGATGATGGATATGAGTCAGGAGTCGGGGAAACGTCCGATTATC 

GGTTGCTGTGCACGGAATTTTTGACTTGGCTCTCTTCT^ 

GTATTCCATTGGAGCATCGAATGTTGGTATCGGACATGACGGATTAA 

4120.1 

ATGTCAAAGATGGATGTTCAGAAAATCATTGCACCGATGATGAAGTTTGTGAATATGCGTGGCATTATAGCTCTAA 

AAGATGGGATGTTAGCAATTTTGCCATTGACAGTAGTTGGTAGTTTGTTCnTGATTATGGGACAATTGCCGTTCG^ 

AGGATTAAATAAGAGCATTGCTAGTGTTTTTGGAGCTAATTGGACAGAGCCGTTTATGCAAGTATATTCAGGAAC^ 

TTTGCTATTATGGGTCTAATTTCTTGTTTTTCAATTGCCTATTCTTATGCT 

AGCTGGAGTTCTATCTGTATCTGCATTCTTTATTTTGCTAA 

GGGACGCTATTAGTAAAGTTTGGTTTGGAGGCCAAGGAATTATCGGTGCTATCATTATAGGTTTGGTAGTAGGAAG 

TATTTATACCTTCTTTATAAAGAGAAAAATTGTTATTAAGATGCCAGAACAAGTTCCACAAGCTA 

TTTGAAGCAATGATTCCAGCATTTGTAATTTTCITATCTTCTATGATTGTATATATTTTA 

TGGCGGAACATTCATAGAAATGATTTATTCTGCTATTCAAGTTCCGTC 

GCTATTGGAATTGCATTCITTATATCATTTTTGTGGTGGTTTGGTGTTCATGGGCAATCG 

GACAGCTCTGCTTTTATCTAATCTTGATGCTAATAAAGCTATC 

GCACATATTGTTACTCAACAATTTTTAGATTCATTTTTAATTCTATCAGGTTCAGGGATTACG 

TGCCATGCTTTTTGCAGCAAAATCAAAACAATACCAAGCC^ 

GTAAATGAGCCAGTTGTATTTGGATTTCCGATTGTCATGAATCCAGT^ 

ACTTGCAGCTGTGATAGTATATGGAGCTATTGCAACAGGTTTCATGCAGCCATTCTCAGGGGTAACATTGCCTTGG 
AGTACACCAGCTATTTTATCAGGATTTTTGGTGGGTGGATGGCAAGGAGTTATTACTCAGCTGGTGATATTAGCGA 
TGTCTACATTGGTTTATTTTCCATTCTTTAAAGTACAGGATCGTTTAGCTTACCAAAATGAAATCAA 

4121.2 

ATGAAGAAAAAGGACTTAGTAGACCAACTAGTCTCAGAGATCGAGACGGGGAAAGTCAGGACACTGGGAATATA 
CGGTCATGGAGCTTCAGGTAAATCAACCTTTGCACAGGAATTGTACCAAGCTTTAGATTCTACTACAGTAAATTTG 
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CTAGAGACAGATCCTTATATCACCTCAGGACGCCATCTGGTAGTACCCAAGGACGCGCCGAATCAAAAGGTGACA 
GCCAGTCTGCCAGTGGCGCATGAACTGGAGAGTTTGCAGAGAGATATCCTTgCTTGCAGGCGGGTATGGATGTCTT 
GA 



4122.1 

ATGAAGAAAAGATACCrAGTCTTGACAGCTTTGCTAGCCTTGAGTCTAGCAGCITGTT^ 

AATGAAGATGGAGAAACTAAGACAGAACAGACAGCCAAAGCTGATGGAACAGTCGGTAGTAAGTCTCAAGGAGC 

TGCCCAGAAGAAAGCAGAAGTGGTCAATAAAGGTGATTACTACAGCATTCAAGGGAAATACGATGAAATCATCGT 

AGCCAACAAACACTATCCATTGTCTAAAGACTATAATCCAGGGGAAAATCCAACAGCCAAGGCAGAGTTGGTCAA 

ACTCATCAAAGCGATGCAAGAGGCAGGTTTCCCTATTAGTGATCATTACAGTGGTTTTAGAAGTTA 

ACCAAGCTCTATCAAGATTATGTCAACCAAGATGGAAAGGCAGCAGCTGACCGTTACTCTGCCCGTCCTGGCTAT 

AGCGAACACCAGACAGGCTTGGCCTTTGATGTGATTGGGACTGATGGTC 

CAATGGCTCTTGGATCATGCAGCTGATTATGGCTTTGTTGT 

ATATGGCTGAAGAATGGCACCTGCGTTATGTAGGAAAAGAAGCTAAAGAAATTGCTGCAAGTGGTCTCAGT^ 
AAGAATACTATGGCTTTGAAGGCGGAGACTACGTCGATTAA 

4125.6 

ATGCGTAAATTCTTAATTATTTTGTTGCTACCAAGTTTTTTGACCATT^ 
TCGTCTATACTTCGAAAGAAATTTATTACCTTTCACAATCTGACTTTGGTAT^ 
CCCATGGTTTATGGAGAGGTTCCTGTTTATGCGAATGAAGATTTAGTAGTC 
CAAGTTTTCAAATAACCGAGTGGCGCTTAAATAAACAAGGA^ 

AGCTGCGGACAAACGATTTTTATATGATCAATCAGAGGTAACTCCAACAATAAAAAAAGTATGGTTAGAATCT 
CTTTAAACTGTACAATAGTCCTTATGATTTAAAAGA^ 

GACAAGACCATGTTTGTAGAAGGAAGAGAATTTCTACATATTGATCAGGCTGGATGGGTAGCTAAAGAATCAACT 

TCTGAAGAAGATAATCGGATGAGTAAAGTTCAAGAAATGTTATCTGAAAAATATCAGAAAGATTCTTTCT 

ATGTTAAGCAACTGACTACTGGAAAAGAAGCTGGTATCAATCAAGATC 

AACTCTCTTATCTCTATTATACGCAAGAAAAAATAAATGAGGGTCTTTATCAGTTAGAT 

ATCTGCAGTCAATGATTTTCCAGGTTCTTATAAACCAGAGGGAAGT^ 

AGAATATTCTTTAAAGGATTTAATTACGAAAGTATCAAAAGAATCT 

TACATTTCAAACCAATCTGATGCCACATTCAAATCCAAGATGTCTGCCATTATGGGAGATGATTGGGATCCAAAAG 
AAAAATTGATTTCTTCTAAGATGGCCGGGAAGTTTATGGA^ 

TTTGACTAAAACAGATTTTGATAGTCAGCGAATTGCCAAAGGTGTTTCTGTTAAAGTAG 

CTCATAAAATTGGAGATGCGGATGAATTTAAGCATGATACGGGTGTTGTCTATGCAGATT 
ATTTTCACTAAGAATTCTGATTATGATACGATTTCTAAGATAGCCAAGGATGTTTATGAGGTTCTAA 

4125.7 

ATGAAAAAACAAAATAATGGTTTAATTAAAAATCCTTTTCTATGGTTATTATTTATC 

CCAGTATTTCTATTCTGGGAATAACTCAGGAGGAAGTCAGCAAATCAACTATACTGAGTTGGTACAAGAAATTAC 
CGATGGTAATGTAAAAGAATTAACTTACCAACCAAATGGTAGTGT^^ 

AAAACAAGTAAAGAAGAAACAGGTATTCAGTTTTTCACGCCATCTGTTACTAAGGTAGAGAAATTTACCAGCACT 

ATTCITCCTGCAGATACTACCGTATCAGAATTGCAAAAACTTGCTACTGACCATAAAGCAGAAGT^ 

ATGAAAGTTCAAGTGGTATATGGATTAATCTACTCGTATCCATTGTGCCATTTGGAATTCTATTCITCTTCCT 

TCTATGATGGGAAATATGGGAGGAGGCAATGGCCGTAATCCAATGAGTTTTGGACGTAGTAAGGCTAAAGCAGCA 

AATAAAGAAGATATTAAAGTAAGATTTTCAGATGTTGCTGGAGCTGAGGAAGAAAAACAAGAACTAGTTGAAGTT 

GTTGAGTTCTTAAAAGATCCAAAACGATTCACAAAACTTGGAGCCCGTATC 

CTCCGGGGACAGGTAAAACTTTGCTTGCTAAGGCAGTCGCTGGAGAAGCAGGTGTTCCATTCTTTAGTATCT 
TTCTGACTTTGTAGAAATGTTTGTCGGAGTTGGAGCTAGTCGTGTTC 

GCACCAGCTATCATCITTATCGATGAAATTGATGCTGTTGGACGTCAACGTGGAGTCGGTCTCGGCGGAGGTAATG 

ACGAACGTGAACAAACCTTGAACCAACTTTTGATTGAGATGGATGGTTTTGAGGGAAATGAAGGGATTATCGTCA 

TCGCTGCGACAAACCGTTCAGATGTACTTGACCCTGCCCTTTTGCGTCCAGGACGTTTTGATAGAAAA 

TGGTCGTCCTGATGTTAAAGGTCGTGAAGCAATCTTGAAAGTTCACGCTAAGAATAAGC 

(HTTAGCAGAAGATGTTGATTTGAAATTAGTGGCTCAACA^ 

CTTGAATGAAGCAGCITTAGTTGCTGCTCGTCGCAATAAATCGATAATTGATGCTTCAGATATTG 
GATAGAGTTATTGCTGGACCTTCTAAGAAAGATAAGACAGTTTCACAA 

GAGGCAGGACATACCATTGTTGGTCTAGTCTTGTCGAATGCTCGCGTTGTCCATAAGGTTACAATTGTACCACGCG 

GCCGTGCAGGCGGATACATGATTGCACTTCCTAAAGAGGATCAAATGCTTCTATCTAAAGAAGATATGAAAGAGC 

AATTGGCTGGCTTAATGGGTGGACGTGTAGCTGAAGAAATTATCTTTAATGTCCAAACCACAGGAGCTTCAAACG 

ACTTTGAACAAGCGACACAAATGGCACGTGCAATGGTTACAGAGTACGGTATGAGTGAAAAACTTGGCCCAGTAC 

AATATGAAGGAAACCATGCTATGCTTGGTGCACAGAGTCCTCAAAAATCAATTTCAGAACAAACAGCTTATGAAA 

TTGATGAAGAGGTTCGTTCATTATTAAATGAGGCACGAAATAAAGCTGCTGAAATTATTCAGTCAAATCGTGAAAC 

TCACAAGTTAATTGCAGAAGCATTATTGAAATACGAAACATTGGATAGTACACAAATTAAAGCTCTTTACGAAAC 

AGGAAAGATGCCTGAAGCAGTAGAAGAGGAATCTCATGCACTATCCTATGATGAAGTAAAGTCAAAAATGAATGA 

CGAAAAATAA 
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4125.10 

ATGAGGGAACCAGATTTTTTAAATCATTTTCTCAAGAAGGGATATTTCAAAAAGCATGCTAAGGCGGTTCT 
TTTCTGGTGGATTAGATTCCATGTTTCTATTTAAGGTATTGTCTACITATCAAAA 

CTAGCTCATGTGAATCATAAGCAGAGAATTGAATCAGATTGGGAAGAAAAGGAATTAAGGA AGTT GGCTGCTGAA 
GCAGAG(m , CCTATTTATATCAGCAATTTTTCAGGAGAATTTTCAGAAGCGCGTGCACG 

TTTTTCAAGAGGTCATGAAAAAGACAGGTGCGACAGCnTTAGTCACTGCCCACCATGCTGATGATCAGGTGGAAA 

CGATTTTTATGCGCTTGATTCGAGGAACTCGCTTGCGCTATCTATCAGGAATTAAGGAGA^ 

GATAGAAATCATTCGTCCCTTCTTGCATTTTCAGAAAAAA 

AATCAGGAGAATCATTATTTTCGAAATCGTATTCGAAATTCTTACTTACCAGAATTGGA 
TTAGGGATGCAATCTTAGGCATTGGCAATGAAATTTTAGATTA 
TAATGTGGAAGATTTACAGCAGTTATTTTCTTACTCT^ 
GTTTTCCAGATTTGAATCTTACAAAAGCTCAGTTTG 

TCATCCGATTAAAAATGGCTATGAATTGATAAAAGAGTACCAACAGTTTCAGATTTGTAAAATCAGTCCGCAGGCT 
GATGAAAAGGAAGATGAACTTGTGTTACACTATCAAAATCAGGTAGCTT^^ 

TTCCATTAGAAGGTGAATTAATTCAACAAATACCTGTTTCACGTGAAACATCCATACACATTCGTCATCGAAAAAC 

AGGAGATGTTTTGATTAAAAATGGGCATAGAAAAAAACTCAGACGTTTATTTATTG 

ATTTGAAAATCCCTATGGAAAAGAGAAACTCTGCTCTTATTATTGAGCAATTTGGTGAAA 

AATTGCGACCAATAATTTGAGTAAAAAAACGAAAAATGATATAATGAACACTGTACTTTATATAGAAA/^ 

TAGGTAA 

4126.1 

ATGAAGCGTTCTTCTCrnTAGTTAGAATGGTTATTTCCATCTTT 

ACTTTCTACTATCAATCAAGTTCTTCAGCCATTGAGGCCACCATTGAGGGCAACAGCCAAACGACCATCAGCCAG 

ACTAGCCACITTATTCAGTCTTATATCAAAAAACTAGAAACCACCTCGACTGGTTTGACCCAGC 

TGGCCTATGCTGAGAATCCCAGTCAAGACAAGGTCGAGGGAATCCGAGATTTGTTTTTGACCATCTTGAAGTCAGA 

TAAGGACTTGAAAACTGTTGTGCTGGTGACCAAATCTGGTCAGGTCATTTCTACAGATGACAGTGTGCAGATGAA 

AACTTCCTCTGATATGATGGCTGAGGATTGGTACCAAAAGGCCATTCATCAGGGAGCTATGCCTGTTTTGACT 

GCTCGTAAATCAGATAGTCAGTGGGTCATTTCTGTCACTCAAGAACTTGTTGATGCAAAGGGAGCCAATCTTG 

TGCTTCGTTTGGATATTTCTTATGAAACTCTGGAAGCCTAT 

CTTCATTATCAATGAAAACCATGAATTTGTCTACCATCCTCAACACACAGTTTATAGTTCGTCTAGCAAAATGGAG 

GCTATGAAACCCTACATCGATACAGGTCAGGGTTATACTCCTGGTCACAAATCCTACGTCAGTCAAGAGAAGATT 

GCAGGAACTGATTGGACGGTGCTTGGCGTGTCATCATTGGAAAAGTTAGACCAGGTTCGGAGTCAGCTCTTGTGG 

ACCITGCTTGGGGCCAGTGTCACATCTCTTCITGTCTG 

TCCITTGAAGGATTTGAGAGAAACCATGTTGGAAATTC 

GCCTATGAACTGAGAGAAGTAACTCGCCAATTTAATGCTATGTTC 

GTAGCCAGGAAGAAACGACCCGTCAGTACCAACTTCAAGCCCTTTCGAGCCAGATTAATCCACATTTCCTCTATAA 

CACTTTGGACACCATCATCTGGATGGCTGAATTTCATGATAGTCAGCGAGTGGTGCAGGTGACCAAGTCCTTGGCA 

ACCTATTTCCGCTTGGCGCTCAATCAAGGCAAGGACrTGATTTGTCTCTCTGACGAAATCAATC 

ATCTCTTTATCCAGAAACAACGCTATGGAGATAAGCTGGAATACGAAATTAATGAAAATGTTGCCITTGATAA 

AGTCnTACCCAAGCTGGTCCTACAACCCCTTGTAGAAAATGCTCTTTACCATGGCATTAAGGAAAAGGAAGGTCA 

GGGCCATATTAAACTTTCTGTCCAGAAACAGGATTCGGGATTGGTCATCCGTATTGAGGATGATGGCGTTGGCTTC 

CAAGATGCTGGTGATAGTAGTCAAAGTCAACTCAAACGTGGGGGAGTTGGTCTTCAAAATGTCGATCAACGGCTC 

AAACTTCATTTTGGAGCCAATTACCATATGAAGATTGATTCTAGAC 

AT AAAT AG AAT AG AAA CT AG CT AA 



4126.7 

ATGAAGCGTTCTTCTCTTTTAGTTAGAATGGTTATTTCCATCTTTCT 

ACTTTCTACTATCAATCAAGTTCTTCAGCCATTGAGGCCACCATTGAGGGCAACAGCCAAACGACCATCAGCCAG 

ACTAGCCACTTTATTCAGTCTTATATCAAAAAACTAGAAACCACCTCGACTGGTTTGACCCAGCAGACGGATGTTC 

TGGCCTATGCTGAGAATCCCAGTCAAGACAAGGTCGAGGGAATCCGAGATTTGTTTrTGACCATCTTGAAGTCAGA 

TAAGGACTTGAAAACTGTTGTGCTGGTGACCAAATCTGGTCAGGTCATTTCTACAGATGACAGTGTGCAGATGAA 

AACITCCTCTGATATGATGGCTGAGGATTGGTACCAAAAGGCCATTCATCAGGGAG 

GCTCGTAAATCAGATAGTCAGTGGGTCATTTCTGTCACTCAAGAACTTGTTGATGCAAAGGGAGCCAATCTTGGTG 

TGCTTCGTTTGGATATTTCTTATGAAACTCTGGAAGCCTATCTCAATCAACTCCAGTTGGGGCAGCAGGGCTTTGC 

CrrCATTATCAATGAAAACCATGAATTTGTCTACCATCCTCAACACACAGTTTATAGTTCGTCTAGCAAAATGGAG 

GCTATGAAACCCTACATCGATACAGGTCAGGGTTATACTCCTGGTCACAAATCCTACGTCAGTCAAGAGAAGATT 

GCAGGAACTGATTGGACGGTGCTTGGCGTGTCATCATTGGAAAAGTTAGACCAGGTTCGGAGTCAGCTCTTGTGG 

ACCTTGCTTGGGGCCAGTGTCACATCTCTTCTTGTCTGTCTCTGCTTAGTGTGGTTCAGTCTTAAACGCTGGATTGC 

TCCTTTGAAGGATTTGAGAGAAACCATGTTGGAAATTGCTTCTGGTGCTCAAAATCTTCGTGCCAAGGA^ 

GCCTATGAACTGAGAGAAGTAACTCGCCAATTTAATGCTATGTTGGATCAGATTGATCAGTTGATGGTAGCTATTC 

GTAGCCAGGAAGAAACGACCCGTCAGTACCAACTTCAAGCCCTTTCGAGCCAGATTAATCCACATTTCCTCTATAA 

CACTTTGGACACCATCATCTGGATGGCTGAATTTCATGATAGTCAGCGAGTGGTGCAGGTGACCAAGTCCTTGGCA 

ACCTATTTCCGCTTGGCGCTCAATCAAGGCAAGGACTTGATTTGTCTCTCTGACGAAATCAATCATGTCCGCCAGT 

ATCTCTTTATCCAGAAACAACGCTATGGAGATAAGCTGGAATACGAAATTAATGAAAATGTTGCCTTTGATAATTT 
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AGTCTTACCCAAGCTGGTCCTACAACCCCTTGTAGAAAATGCTCTTTACCATGGCATTAAGGAAAAGGAAGGTCA 

GGGCCATATTAAACTTTCT.GTCCAGAAACAGGATTCGGGATTGGTCATCCGTATTGAGGATGATGGCGTTGGCTTC 

CAAGATGCTGGTGATAGTAGTCAAAGTCAACTCAAACGTGGGGGAGTTGGTCTTCAAAATGTCGATCAACGGCTC 

AAACTTCATTTTGGAGCCAATTACCATATGAAGATTGATTCTAGACCCCAAAAAGGGACGAAAGTTGA 

ATAAATAGAATAGAAACTAGCTAA 

4127.4 

ATGTTTTTTAAATTATTAAGAGAAGCTCTTAAAGTCAAGCAGGTTCGATCAA 

TTTGGTCTTTCGTATCGGAACTAGCATTACAGTTCCTGGTGTGAATGCC 

CCITCTTAAACATGTTGAGCTTGGTGTCGGGGAATGCCCTAAAAAACTTTTCGATT^ 

CTATATCACCGCTTCTATTGTTGTCCAACTCTTGCAAATGGATATTTTACCCAAGTTTG 

GGGGAAGTAGGTCGAAGAAAATTGAATCAAGCTACTCGTTATATTGCTCT 

GGATTACAGCTGGTTTTAATACCTTGGCTGGAGCTCAATTGATTAAAACTGCTTTAACT 

GATTGGTATCATCTTAACAGCTGGTAGTATGATTGTCACITGGTTGGGTGAGCAAATTACAGATAAGGGATACGGA 

AACGGTGTTTCCATGATTATCTTTGCCGGGATTGTTTCCTCAATTCCAGAGATGATTCAGGGCATCT 

ACTTTGTGAACGTCCCAAGTAGCCGTATCACITCATCT^ 

ATTATTTACITTACAACTTATGTTCAACAAGCAGAATACAAAATTCCAATCCAATA 
CTCCATCTAGCTCTTACCTTCCGTTAAAAGTAAACCCTGCT 

GCAGCGCCTGCGGCTATTCTTCAGTTTTTGAGTGCCACAGGTCATGATTGGGCTTGGGTAAGGGTAGCACAAGAGA 
TGTTGGCAACTACTTCTCCAACTGGTATTGCCATGTATGCTTTGTTGATTATTCTCT^ 

TACAGATTAATCCTGAAAAAGCAGCAGAGACCTACAAAAGAGTGGTGCCTATATCCATGGAGTTCGTCCTGGTAA 
AGGTACAGAAGAATATATGTCTAAACTTCTTCGTCGTCTTGCAACT^ 



4127.5 

ATGGATATTAGACAAGTTACTGAAACCATCGCCATGATTGAGGAGCAAAACTTCGATATTAGAACCATTACCATG 

GGGATTTCTCTTTTGGACTGTATCGATCCAGATATCAATCGTGCTGCGGAGAAAATCTATCAAAAAATTACGACAA 

AGGCGGCTAATTTAGTAGCTGTTGGTGATGAAATTGCGGCTGAGTTGGGAATTCCTATCGTTAATAAGCGTGT 

GGTGACACCTATTTCTCTGATTGGGGCAGCGACAGATGCGACGGACTACGTGGTTCTGGCAAAAGCGCTTGATAA 

GGCTGCGAAAGAGATTGGTGTGGACTTTATTGGTGGTTTTTCTC 

GAGATTCTCATCAATTCCATTCCTCGCGCTTTGGCTGAGACGGATAAGGTCTGCTCGTCAGTCAAT ATCG GCT 
CCAAGTCTGGTATTAATATGACGGCTGTGGCAGATATGGGACGAATTATCAAGGAAACAGCAA ATCTT TCAGATA 
TGGGAGTGGCCAAGTTGGTTGTATTCGCTAATGCTGTTGAGGACAATCCATTTATGGCGGGTGCCTTTCATGGTGT 
TGGGGAAGCAGATGTTATCATCAATGTCGGAGTTTCTGGTCCTGGTC^ 

GGACAGAGCTTTGATGTAGTAGCCGAAACAGTTAAGAAAACTGCCTTTAAAATCACTCGTATCGGTCAATTGGTT^ 
GTCAAATGGCCAGTGAGAGACTGGGTGTGGAGTTTGGTATTGTGGACITGAGTTTGGCACCAACCCCTGCGGTTGG 
AGACTCTGTGGCACGTGTCCTTGAGGAAATGGGGCTAGAAACAGTTGGCACGCATGGAACGACGGCTG CCTTG GC 
CCTCITGAACGACCAAGTTAAAAAGGGTGGAGTGATGGCCTGCAACCAAGTC 

CCTGTTTCTGAGGATGAAGGAATGATTGCTGCAGTGCAAAATGGCTCTCTTAATTTAGAAAAACTAGAAGCT 

CGGCTATCTGTTCTGTTGGATTGGATATGATTGCCATCCCAGAAGATACGCCTGCTGAAACTATTGCGGCT 

TGCGGATGAAGCAGCAATCGGTGTTATCAACATGAAAACAACAGCTGTTCGTATCATTCCCAAAGGAAAAGAAGG 

CGATATGATTGAGTTTGGTGGTCTATTAGGAACTGCACCCGTTATGAAGGTTAATGGGGCTTCGTCTGTCGACTTC 

ATCTCTCGCGGTGGACAAATCCCAGCACCAATTCATAGTTTTAAAAATTAA 

4128.1 

ATGACACAGATTATTGATGGGAAAGCTTTAGCGGCCAAATTGCAGGGGCAGTTGGCTGAAAAGACTGCAAAATTA 

AAGGAAGAAACAGGTCTAGTGCCTGGTTTGGTAGTGATTTTGGTTGGGGACAATCCAGCCAGCCAAGTCTACG^ 

CGCAACAAGGAGAGGTCAGCCCTTGCGGCTGGTTTCCGTAGCGAAGTAGTACGGGTTCCAGAGACCATTACTCAA 

GAGGAATTGTTAGACCTGATTGCTAAATACAATCAGGATCCAGCTTGGCATGGGATTTTGGTTCAGTTGCCATTAC 

CAAAACACATTGATGAAGAGGCGGTTCTATTGGCTATTGACCCAGAAAAGGATGTGGATGGTTTCCATCCTCTAA 

ACATGGGGCGTCTTTGGTCTGGTCATCCAGTCATGATTCCITCGACACCGGCAGGAATTATGGAAATGTTCCATGA 

ATATGGGATTGACTTGGAAGGTAAAAATGCAGTCGTCATCGGTCGATCCAATATTGTCGGAAAACCTATGGCCCA 

GCTTCTTTTGGCAAAGAATGCAACAGTAACCTTGACTCACTCACGTACTCATAATCTTTCCAAGGTGGCT 

GCAGATATTCTGGTTGTTGCAATCGGTCGTGCCAAGTTTGTGACTGCTGACTTTGTCAAACCAGGTGCGGTAGTCA 

TTGACGTTGGGATGAACCGCGATGAAAATGGTAAGCTCTGTGGGGATGTTGATTATGAGGCGGTTGCCCCACTTGC 

TAGCCACATTACGCCAGTCCCTGGAGGTGTCGGTCCTATGACCATTACTATGCTGATGGAGCAAACCTATCAGGCA 

GCACTTAGGACATTGGATAGAAAATAA 

4128.2 

ATGTCTAAATTTAATCGTATTCATTTGGTGGTACTGGATTCTGTAGGAATCGGTGCAGCACCAGATGCTAATAACT 

TTGTCAATGCAGGGGTTCCAGATGGAGCTTCTGACACACTGGGACACATTTCAAAAACAGTTGGTTTGAATGTCCC 

AAACATGGCTAAAATAGGTCTTGGAAATATTCCTCGTGAAACTCCTCTTAAGACTGTAGCAGCTGAAAGCAATCC 

AACTGGATATGCAACAAAATTAGAGGAAGTATCTCTTGGTAAGGATACTATGACTGGACACTGGGAAATCATGGG 

ACTCAACATTACTGAGCCTTTCGATACTTTCTGGAACGGATTCCCAGAAGAAATCCTGACAAAAATCGAAGAATTC 

TCAGGACGCAAGGTTATTCGTGAAGCCAACAAACCTTATTCAGGAACGGCTGTTATCTATGATTTTGGACCACGTC 
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AGATGGAAACTGGAGAGTTGATTATCTATACTTCAGCTGACCCTGTTTTGCAGATTGCTGCCCACGAAGACATTAT 

TCCTTTGGATGAATTGTACCGTATCTGTGAATACGCTCGTTCGATTACCCTTGAGCGTCCTGCCCTTCTTGGTCGCA 

TCATTGCTCGCCCITATGTAGGTGAACCAGGTAACTTCACrCGTACGGCAAACCGTCGTGACTTGGCTGTATCTCC 

ATTTTTCCCAACTGTTTTGGATAAATTGAATGAGGCTGGTATCGATACITATGCTGTGGGTAA 

TTTAACGGTGCTGGTATCAACCATGACATGGGTCACAACAAGTCAAATAGTCATGGAATTGATACACTATTGAAG 

ACTATGGGACTTGCTGAGTTTGAAAAAGGATTCTCATTCACAAACCTAGTTGACTTTGATGCCCTTTACGGCCATC 

GTCGTAATGCTCACGGTTACCGTGATTGCTTGCATGAGTTTGATGAACGCTTACCTGAAATTATCGCAGCTATGAG 

AGAGAATGACCTTCTCTTGATTACTGCGGACCATGGAAATGACCCAACGTATGCAGGAACGGATCACACTCGGGA 

ATATATTCCATTGTTGGCCTATAGCCCTGCCTTTAAAGGAAATGGTCTCATTCCAGTAGGACATTTTGCAGATAT^ 

CAGCGACTGTTGCCGATAACTTTGGTGTGGAAACTGCTATGATTGGGGAAAGTTTCrTAGA 

4129.2 

ATGTTTATTTCCATCAGTGCTGGAATTGTGACATTTT^ 

TAGAAAGGCGCAAATTACAGGCCAGCAGATGCATGAGGATGTCAAACAGCATCAGGCAAAAGCTGGGACTCCTA 
CAATGGGAGGTTTGGTTTTCITGATTACTTCTGTTTTGGTTGCITTC 

ATAATGTGGGAATGATTTITGTTCATCTTGGTCTTGTATGGCTTGGTCGGATTTTTAGATGACnTTCT 

CGTAAAATCAATGAGGGGCTTAATCCTAAGCAAAAATTAGCTCTTCAGCTTCTAGGTGGAGTTATCTTCT 

TCTATGAGCGCGGTGGCGATATCCTGTCTGTCTTTGGTTATCCAGTTCATTTGGGAT^ 

(nTTTCTGGCTAGTCGGTTTTTCAAACGCAGTAAACTTGACAGACGGTGTTGACGGTTTAGCT 

GATTAGTTTGTCTGCCTATGGAGTTATTGCCTATGTGCAAGGTCAGATGGATATTCTTCTAGTGATTCTTGCCATGA 

TTGGTGGTTTGCTCGGTTTCTTCATCTTTAACCATAAGCCTGCCAAGGTCTTTATGGGT 

CTAGGTGGGATGCTGGCAGCTATCTCTATGGCTCTCCACCAAGAATGGACTCTCTTGATTATCGGAATTGTGTATG 
V1T1TGAAACAACTTCTGTTATGATGCAAGTCAGTTATTTC 

GCCTGTACATCACCATTTTGAGCTTGGGGGATTGTCTGGTAAAGGAAATCCTTGGAGCGAGTGGAAGGTTGACTTC 
TTCTTTTGGGGAGTGGGACTTCTAGCAAGTCTCCTGACCCTAGCAATTTTATATTTGATGTAA 

4133.1 

TTGTTTAAGAAAAATAAAGACATTCTTAATATTGCATTGCCAGCTATC 
GAATGGTGGACAGTTATTTGGTTGCTCATTTAGGATTGATAGCTATTTCAGGGGTT^ 
CACCATTTATCAGGCGATTTTCATCGCTCrGGGAGCTGCTATTTCCA 
GACCAGTCGAAGTTGGCCTATCATGTGACTGAGGCGTTGAAGATTACCTTACT^ 

GTCCATCTTCGCTGGGAAAGAGATGATAGGACTTTTGGGGACGGAGAGGGATGTAGCTGAGAGTGGTGGACTGTA 
TCTATCTTTGGTAGGCGGATCGATTGTTCTCTTAGGTTTAATGACTAGTCTAGGAGCCITGA 

AATC C ACGTCTG C CTCTCT ATGTT AGTTTTTT ATCC AATG CCTTG AAT ATTCnTTTTC AAGTCT CT ATTTTTGTT 

CTGGATATGGGGATAGCTGGTGTTGCTTGGGGGACAATTGTGTCTCGTTTGGTTGGTCTTGTGATTTT 

AATTAAAACTGCCTTATGGGAAGCCAACTITTGGTTTAGATAAGGAACTGTTGACCTTGGCTT^ 

AGAGCGACTTATGATGAGGGCTGGAGATGTAGTGATCATTGCCTTGGTCGTTTCITTTGGGACGGAGGCAGTTGCT 

GGGAATGCAATCGGAGAAGTCTTGACCCAGTTTAACTATATGCCTGCCTTTGGCGTCGCTACGGCAACGGTCATGC 

TGTTGGCCCGAGCAGTTGGAGAGGATGATTGGAAAAGAGTTGCTAGTTTGAGTAAACAAACCTTTTGGC 

GTTCCTCATGTTGCCCCTGTCCITTAGTATATATGTCITGGGTGTACCATTAACTCATCrCT 

TAGCGGTGGAGGCTAGTGTTCTAGTGACACTGTTTTCACTACTTGGGACCCCTATGACGACAGGAACAGTCATCTA 
TACGGCAGTCTGGCAGGGATTAGGAAATGCACGCCTCCCTTTTTATGCGACAAGTATAGGAATGTGGTGTATCCGC 
ATTGGGACAGGATATCTGATGGGGATTGTGCTTGGTTGGGGCTTGCCTGGTATTTGGGCAGGGTCTCTCTTGGATA 
ATG GTTTTCG CTG GTT ATTTCT ACG CT ATCGTT A CC AGCG CT AT ATG AG CTTG AAAG G AT AG 

4135.2 

ATGCAAACTCAAGAAAAACACTCGCAAGCAGCCGTTCITGGCTTGCAGCACr^ 

ATCCTGGTTCCCATCATGATTGCGACAGCCCTTGGCTATTCAGCTGAGCAGTTGACCTACCTGATTTCTACAGATA 
TCTTCATGTGTGGGGTGGCAACCITCCTCCAACTCCAACTCAACAAAT^ 

TGGAGTTGCATTCCAGTCGGTCGCTCCCTTGATTATGATTGGGCAAAGCCATGGTAGTGGCGCTATGTTTGGTG 

CTTATCGCATCTGGGATTTACGTGGTTCTTGTTTCAGGCATCT^ 

AACAGGATCTGTTATTACCACGATTGGTTTAACCHTGATCCCT^ 

GAGCCAACTGGTCAAAGTCTCTTGCirGCAGCTArrACTGTTCTGATTATCCT 

GATTTATCAAGTCTATCTCTATTTTGATTGGTCTGGTTGTTGGAACTGC 

TTCTCTCCTGTTGCGGTAGCTCCACHTGTCCATGTCCCAACTCCACT 

ATCTATTGTCATGATGTGTATCATCGCAACGGTGTCTATGGTTGAGTCAACTGGTGTTTATCTGGCCTTGTCT 
TCACAAAGGATCCAATCGACAGCACGCGCCTTCGCAACGGATACCGCGCAGAAGGTTTGGCCGTACTTCTCGGAG 
GAATCTTTAACACCTTCCCTTACACCGGATTTTCACAAAACGTTGGTTTGGTTAAATT 
CCTGCCAATCTACTACGCAGCTGGTTTCCTGGTTCTCCnTGGACTGCTTCCTAAG 

TTCCAAGCTCCGTCCTCGGTGGTGCCATGCTGGTAATGTTTGGTTTTGTATCAATTCAAGGGATGCAAATCCTCGC 
CCGTGTTGACTTTGCTAACAATGAACACAACTTCCTTATCGCAGCTGTTTCAATCGCTGCAGGTGTCGG 
AACAGTAATCTCTTTGTCAGCATGCCGACAGCCnTCCAAATGTTCTTCTCAAACGGAATCGTCGTAGCCAGCCTAC 
TCGCTATTGTCCTCAATGCCGTATTAAATCATAAAAAGAAATAA 

4136.2 
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ATGAAAGATAGAATAAAAGAATATTTACAAGACAAGGGAAAGGTGACTGTTAATC 
AAAGACAGTTCCAAGGATTTTCGTGAGTTGATTAAAACCTTGTC 

AAGAAGATGGTAGTCTGACATTAGAAATTAAGAAAAAACATGAGATTACCCTCAAGGGGATTTTTCATGCCCATA 
AAAATGGCTTTGGCTTTGTTAGTCTGGAAGGCGAGGAGGACGACCTTT^ 

TATTGATGGTGATACCGTCGAGGTAGTGATTAAGAAAGTCGCTGACCGCAATAAGGGAACAGCAGCAGAAGCCAA 

AATTATTGATATCCTAGAACACAGTTTGACAACAGTTGTCGGGCAAATCGTTCTGGATCAGGAAAAACCT 

GCTGGCTATATTCGTTCAAAAAATCAGAAAATCAGTCAACCGATTTATGT^ 

GGAACAGAAGTTCTCAAAGTCTTTATCGATAAATACCCAAGCAAGAAACATGATTTCTTTGTCGCGAGTGTTCT 

ATGTAGTGGGACACTCAACGGATGTCGGAATTGATGTTCTTGAGGTCTTGGAATCAATGGACATTGTATCCGAGTT 

TCCAGAAGCTGTTGTTAAGGAAGCAGAAAGTGTGCCTGATGCTCCGTCTCAAAAGGATATGGAAGGTCGTCTGGA 

TCTAAGAGATGAAATTACCTTTACCATTGACGGTGCGGATGCCAAGGACTTGGACGATGCAGTGCATATCAAGGC 

TCTGAAAAATGGCAATCTGGAGTTTGGGGTTCACATCGCAGATGTTTCTTATTATGTGACCGAGGGGTCTGCCCTT 

GACAAGGAAGCCCnTAACCGTGCGACTTCTGTTTACGTGACAGACCGAGTGGTGCCAATGCTTCCAGAACGACTA 

TCAAATGGCATCTGCTCTCTCAATCCCCAAGTTGACCGCCTGACCCAGTCTGCTATTAT 

GGAGATTGATAAACATGGTCGTGTGGTCAACTATACCATTACACAAACAGTTATCAAGACCAGTTTTCGTATGACC 
TATAGCGATGTCAATGATATCCTAGCTGGCGATGAAGAAAAGAGAAAAGAATATCATAAAATTGTATCAAGTATC 
GAACTCATGGCCAAGCTTCATGAAACTTTAGAAAACATGCGTGTGAAACGTGGAGCT 

AAGCGAAGATTTTAGTGGATAAACAAGGTAAGCCTGTTGATATCGTTCTTCGGCAGCGTGGTATTGCCGAGCGGA 
TGATTGAGTCTTTTATGTTGATGGCTAATGAAACAGTTGCCGAACATTTC 

CGAATTCACGAGGAGCCTAAGGCTGAAAAGGTTCAGAAGTTTATTGATTATGCTTCGAGTTTTGGCTT 
ATGGAACTGCCAGTGAGATTAGTCAGGAGGCACTTCAAGACATCATGCGTGCTGTTGAGGGAGAACCTTATGCAG 
ATGTATTGTCCATGATGCTTCTTCGCTCTATGCAGCAGGCTCGTTATTCGGAGCACAATCACGGCCACTATGGACT 
AGCTGCTGACTATTATACTCACITTACCAGTCCAATTCGTCGTTATCCAGACCTTCITGT^ 

ATTACGGCCGTTCTAAGGAAATAGCAGAGCATTTTGAACAAGTGATTCCAGAGATTGCGACCCAGTCTTCCAACC 
GTGAACGTCGTGCCATAGAAGCTGAGCGTGAAGTCGAAGCCATGAAAAAGGCTGAGTATATGGAAGAATACGTGG 
GTGAAGAGTATGATGCAGTTGTATCAAGTATTGTCAAATTCGGTCTCITTGTCGAATTGCCAAACACAGTTGAAGG 
CTTGATTCACATCACTAATCTGCCTGAATTTTATCATTTCAATC 

GGTATCACTTTCCGAGTGGGTCAGCAGATCCGTATCCGTGTTGAAAGAGCGGATAAAATGACTGGAGAGATTGAT 
TTTTCATTCGTACCTAGTGAGTTTGATGTGATTGAAAAAGGCTTGAAACAGTCTAGTCGT 

AGTGGCAGAGGGCGTGATTCAAATCGTCGTTCGGATAAGAAGGAAGACAAGAGAAAATCAGGACGCTCAAATGA 

TAAGCGTAAGCATTCACAAAAAGACAAGAAGAAAAAAGGAAAGAAACCTTTTTACAAGGAAGTAGCT 

GAGCCAAGCATGGCAAAGGGCGAGGGAAAGGTCGTCGCACAAAATAA 

4137.2 

ATGGGCACAACAGGATTTACAATAATTGACITAATTATCTTC 

CTATTTCTCTAAAAAAGAGATGAAAGGAAAAGAGTTCTTTAAAGGAGATGGTTCGGTTCCTT 
GTATCCATTTTTGCCACAATGCTCAGTCCGATTTCCTTCTTGGGACTCGCTGGTAGCTCrTAT 
TTTATGGTTTGCTCAATTAGGGATGGTAGTAGCTATTCCACTGACAATC 
TAGACATCGATACGGCATATGATTACTTGGATAAACGTTTTAAT^ 

GTTTATTATTTATCAATTGGGACGTATGTCTATCATTATGTACCTCCCATCAGCTGGTTTATCAGTAT^ 

TTGACATCAATATTTTGATTATTTTGATGGGTGTAGTTGCAATTGTTTATTCTTATACTGGTC 

TTATGGACAGACTTTATTCAAGGTGTGATTCTGATTAGTGGTGTCGTTTTAGCTTTATT^ 

TAAAGGTGGCTTTGGTGCAGTAGCAGAAACATTAGCAAACGGGAA^ 

TCCTAACTTGCITTCAAACTCCATCrTTTTA^ 

TCAAGATTTGGTTCAACGTTTTACTACAACACAAAATATTAAGAAACTTAATAAGATGTTGTTCACA 
TTGTCACTTGCAACTGCAACAGTCTTTTACTTGATTGGTACAGGCTT^ 

ATAGTGCAGCTAGCAATATCCCTCAAGACCAAATCTTTATGTACTTTATTGCATACCAGTTACCAGTAGGTATCAC 

AGGTTTGATCTTGGCAGCGATTTATGCAGCATCTCAATCAACTATTTCAACAGGTTTGAACTCrGTTG 

TGGACATTGGATATTCAAGATGTCATTTCTAAAAATATGTCAGACAATCGTCGTACGAAAATTGCACAATTCGTAT 

CTCTAGCAGTAGGTTTATTCTCAATTGGTGTTTCCATTGTCATGGCTCACTCAGATATTAAATCTGCATACGAATGG 

TTCAATAGTTTCATGGGACTTGTACTTGGTCTACITGGTGGTGTATTTATTCTTGGA 

TAAACAAGGTGCTTATGCAGCGCTGATTGTATCAACCATCGTCATGGTATTTATTAAATACTTCCITCCTCCAA 
GCTGTTAGCTACTGGGCATATTCATTGATTTCAATCTCTGTATCAGT^ 

TGGAAATAAAGTATCTGCACCTAAATATACAACGATTCATGATATTACAGAAATTAAAGCGGATTCAAGTTGGGA 
AGTTCGTCACTAA 

4138.1 

ATGAAATTTAGTAAAAAATATATAGCAGCTGGATCAGCTGTTATCGTATCCTTGAGTCTATGTGCCTATGCACTAA 

ACCAGCATCGTTCGCAGGAAAATAAGGACAATAATCGTGTCTCTTATGTGGATGGCAGCCAGTCAAGTCAGAAAA 

GTGAAAACTTGACACCAGACCAGGTTAGCCAGAAAGAAGGAATTCAGGCTGAGCAAATTGTAATCAAAATTACAG 

ATCAGGGCTATGTAACGTCACACGGTGACCACTATCATTACTATAATGGGAAAGTTCCTTATGATGCCCTCTTTAG 

TGAAGAACTCTTGATGAAGGATCCAAACTATCAACTTAAAGACGCTGATATTGTCAATGAAGTCAAGGGTGGTTA 

TATCATCAAGGTCGATGGAAAATATTATGTCTACCTGAAAGATGCAGCTCATGCTGATAATGTTCGAACTAAAGAT 

GAAATCAATCGTCAAAAACAAGAACATGTCAAAGATAATGAGAAGGTTAACTCTAATGTTGCTGTAGCAAGGTCT 

CAGGGACGATATACGACAAATGATGGTTATGTCTTTAATCCAGCTGATATTATCGAAGATACGGGTAATGCTTATA 
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TCGTTCCTCATGGAGGTCACTATCACTACATTCCCAAAAGCGATTTATCTGCTAGTGAATTAGCAGCAGCTAAAGC 

ACATCTGGCTGGAAAAAATATGCAACCGAGTCAGTTAAGCTATTCTTCAACAGCTAGTGACAATAACACGCAATC 

TGTAGCAAAAGGATCAACTAGCAAGCCAGCAAATAAATCTGAAAATCTCCAGAGTCTTTTGAAGGAACTCTATGA 

TTCACCTAGCGCCCAACGTTACAGTGAATCAGATGGCCTGGTCTTTGACCCTGCTAAGATTATCAGTCGTACACCA 

AATGGAGTTGCGATTCCGCATGGCGACCATTACCACTTTATTCCTTACAGCAAGCTTTCTGCCT^ 

TTGCCAGAATGGTGCCTATCAGTGGAACTGGTTCTACAGTTTCTACAAATGCAAAACCTAATGAAGTAGTGTCT 

TCTAGGCAGTCTTTCAAGCAATCCTTCTTCTT^ 

TTAATCCAAAAGATATCGTTGAAGAAACGGCTACAGCTTATATTGTAAGACATGGTGATCATTTCCATTACATTCC 

AAAATCAAATCAAATTGGGCAACCGACTCTTCCAAACAATAGTCTAGCAACACCTTCTCCATCTCTTCCAATCAAT 

CCAGGAACTTCACATGAGAAACATGAAGAAGATGGATACGGATTTGATGCTAATCGTATTATCGCTGAAGATGAA 

TCAGGTTTTGTCATGAGTCACGGAGACCACAATCATTATTTCTTCAAGAAGGACTTGACAGAAGAGCAAATTAAG 

GTGCGCAAAAACATTTAG 

4139.1 

ATGAAAAAAAGAGCAATAGTGGCAGTCATTGTACTGCTTTTGATTGGGCTGGATCAGTTGGTCAAATCCTA 
TCCAGCAGATTCCACTGGGTGAAGTGCGCTCCTGGATCCCCAATTTCGTTAGCTTGACCTACCTGCAAAATCGAGG 
TGCAGCCTTTTCTATCITACAAGATCAGCAGCTGTTATTCGCTGTCATTACrCTGGTTGTCGTGA 
GGTATTTACATAAACACATGGAGGACTCATTCrGGATGGTCHTGG 

AAACTTTATTGACAGGGTCAGTCAGGGCTTTGTTGTGGATATGTTCCACCTTGACTTTAT^ 

ATGTGGCAGATAGCTATCTGACGGTTGGAGTGATTATTTTATTGATTGCAATGCTAAAAGAGGAAATAAATGGAA 
ATTAA 

4139.5 

ATGAATACAAATCTTGCAAGTTTTATCGTTGGACTGATCATCGATG 

ATGGTCAAACCTATGCTCTTGCTAAGGAAGAAGGCCAACATACAGTAGGGGATACGGTCAAAGGTTTTGCATACA 

CGGATATGAAGCAAAAACTCCGCCTGACAACCTTAGAAGTGACTGCCACTCAGGACCAATTTGGTTGGGGACGTG 

TCACAGAGGTTCGTAAGGACTTGGGTGTCnTTGTGGATACAGGCCTTCCTGACAAGGAAATCGTTGTGTCACTCGA 

TATTCTCCCTGAGCTCAAGGAACTCTGGCCTAAGAAGGGCGACCAACTCTACATCCGTCTTGAAGTGGATAAG^ 

AGACCGTATCTGGGGCCTCTTGGCTTATCAAGAAGACTTCCAACGTCTTGCTCGTCCTGCCTACAACAACATGCAG 

AACCAAAACTGGCCAGCCATTGTTTACCGTCTCAAGCTGTCAGGAACTTTTGTTTACCTACCAGAAAAT 

TTGGTTTTATTCATCCTAGCGAGCGTTACGCAGAGCCACGTTTGGGGCAAGTATTAGATGCGCGCGTTATTGGT^ 

CCGTGAAGTGGACCGCACTCTGAACCTCTCCCTCAAACCACGCTCCTTTGAAATGTTGGAAAACGATGCTCAGATG 

ATTTTGACTTATTTGGAAAGCAATGGCGGTTTCATGACCTTAAATGACAAGTCATCTCCAGACGACATCAAGGCAA 

CCITTGGCATTTCTAAAGGTCAGTTCAAGAAAGCTTTAGG 

GTTTGGGACAGAGTTGATTTAG 

4139.8 

ATGAAAGATGTTAGTCTATTTTTATTGAAAAAAGTTTTCAAAAGCCGCTTAAACTGG 

ATCTGTACTCGGTGTTACCTTTTATTTAAATAGTCAGACTGCAAACTCACACAGCTTGGAGAGCAGGTTGGAAAGT 

CGCATTGCAGCCAACGAGAGGGCTATCAATGAAAATGAAGAGAAACTCTCCCAAATGTCTGATACCAGCTCGGAG 

GAATACCAGTTTGCTAAAAATAATTTAGACGTGCAAAAAAATCTTTTGACGCGAAAGACAGAAATTCT 

TAAAAGAAGGGCGCTGGAAAGAAGCCTACTATTTGCAGTGGCAAGATGAAGAGAAGAATTATGAATTTGTATCAA 

ATGACCCGACTGCTAGCCCTGGCTTAAAAATGGGGGTTGACCGCGAACGGAAGATTTACCAAGCCCTGTATCCCT 

TGAACATAAAAGCACATACTTTGGAGTTTCCGACCCACGGGATTGATCAGATTGTCTGGATTTTAGAGGTTATCAT 

CCCAAGTTTGTTTGTGGTTGCTATTATTTTTATGCTAACACAACTATTTG^ 

ACAGCTCACITATATCCTGTTTCAAAAGTGACATTTGCAATATCCTCrCITGGAGTTGGA 

TGCTGTTTATCGGAATCTGTGGCTTTTCTTTTCTAGTC 

TACCCAATTTATAGCTTAGTGAATCAAGAAGTAACTATTGGGAAAATACAAGATGTATTATTTCCTGGCTTGCTCT 
TAGCITTCITAGCCTTTATCGTCATTGTGGAAGTTGTGTACT^ 

CTCTTTCTTTCACTCATTGGGATTGTTGGCTTATTGTTTGGTATCCAAACCATTCAGCCTCTTC 
TCTGATTCCCTTTACTTACTTGCGTTCAGTGGAGATTTTATCTGGAAGATTACCTA 
TAAATTGGAGCATGGGAATGGTCTTACTTCCTTGCCTGATTATCTTTTTGCrAT^ 
TGGGGAAGTTCACAGAAAAAAGAATTTTTTAATAGATTCTAG 

4141.1 

ATGATGAAGTTCATATTGGATATTGTTAGTACACCAGCTATTT^ 

TCAGAAGAAGAAATTACCTGATATTATTAAAGGTGGAATTAAGACCTTTGTTGGTTTCTT 
GCAGGAATTGTACAAAATTCTTTAAATCCATTTGGTACCATGTTTC 

GAATAATGAAGCAATTGTAGCTGTAGCTTTAACAACATATGGCTCAGCTACTGCAATGATTATGTTTGCAGGCATG 
GTGTTCAATATCTTAATCGCTCGTTTTACTCGATTTAAATATATTTTTTTAACAGGGCACC 

ATGTATGATTGCGGTCATTTTATCAGTTGCTGGCTTTACTAGCTTGCCTCTCATCTTACTAGGAGGATTAGCACTCG 
GTATTATTATGAGTATTTCCCCAGCATTTGTGCAAAAATATATGGTTCAATTAACTGGAAATGACAAGGTAGCTTT 
AGGTCATTTCAGTTCTTTGGGATATTGGTTGAGTGGTTTTACTGGT^ 
GAGGACATTAAATTTCCAAAGAGTTTAGCTTTTTTACGTGATAGTACTGTTAGTATTA 

TTACATTATTGTAGCTATCTTTGCAGGGTCAGAATATATAGAAAAAGAAATCAGTAGTGGTACAAGTGGTCTAGTT 
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TATGCnTTACAATTAGCAGGTCAATTTGCAGCAGGGGT^ 
AATTGTTCCAGCCTTTAAAGGTATTTCAG 

ATACTTATGCACCCAATGCAGTTCTAATTGGATTTATCTCTA^ 

ATTGCTTCAGGAACGGTTGTTATCTTACCAGGTGTTGTGCCTCATTTCTTCTGTGGAGCGACT 
GAATGCATCTGGTGGTGTTCGTGGAGCCACTATTGGAGCATTTT^ 

TTTTAATGCCAGTTTTGGGAGGACTTGGTTTCCAAGGATCAACTTTCTCAGATGCAGATTTTGGTCT 

ATTTTAGGAATGTTAAATCAATTTGGCTCACAAGCAGGCATTC 

TGGAGTATCCTTTATTAAAAAGCCATCTGCAACGGAGGAATAA 

4142.3 

ATGATTAAAACATTTCTCTCTGCCCTTTCGGTCATTCTCTTTTCTAT 

TCTAATCITAACATTTGGCTATCTACCCAACCTATCrrTGGCACAGATTTATGCCTTCC 

GGCTGCTATTTTAAGTTTCTTATTTTTTTTCCT 

TTTTGCTCTTACTATCGCTCATATTACTATTATTCGG^ 

ACCTTAAAATTAGTAACTTGGAACGTCGCTAATCAAATAGAA^ 

ACGCCGATATGGCTATATTCCCTGAACTAGCTACCAATATCAGAGGTGAGCAAGAAAACCAGAGAATCAAACTAT 
TGTTTCATCAAGTTGGACTTTCTATGGCCAACTATGATATTTrCACTTCTCCAC 

GTGACTGTGATTGTCAAGAAAAGTTATGGTTTCTATACAGAAGCTAAAACTTTTCATACAACACGGTTCGGGACAA 

TTGTATTACATTCGAGAAAACAAAATATACCAGATATCATTGCCTTGCATACTGCGCCTCCTCTGCCAGGTT^ 

GGAAATCTGGAAGCAAGACTTAAACATCATTCATAATCAATTGGCTTCAAA^^ 

GATTTTAATGCAACTATGCGTCATGGAGCACTTGCAAAAATAAGCTCTCATAGGGACGCATTAAATGCACT 

CCITTTGAAAGAGGAACTTGGAATAGCCAAAGTCCAAAACTTTTTA^ 

AAAACCACTACTATGTTAAAGATTTAGACATTGTAAGTTTTCAAAACTCT 

CACATTTTAA 

4142.4 

ATGAATCCAATCCAAAGATCTTGGGCTTATGTCAGCAGAAAGCGACTGAGAA 
TGGTCTTATTGGCCGGAATTTCAGCCTGTTTGACTCTGATGAAG 

ATCACTCAATACATCTTTTTCTATTAAGAAGATAGAGAATGGTCAGACATTCAAGTTGTCAGACCTAGCATCT 
AGCAAGATTAAGGGGCTGGAAAATGTCTCTCCTGAAC^ 

ACTGGCGAGCAGAGCGTGGAGCGTGATGATTTATCAGCTGCAGACAATAACTTGGTTAGCnTAACGGCTCTTGAG 
GATTCATCCAAGGATGTAACCTTTACCAGTTCGGCTTTCA^ 

TCCAAGAAAATCCITATCCACGAAGAATTGGCTAAGAAGAACGGTCTTTCGCTTCATGACAAGATTG 

CTGGTCAGTCTGAATCTGGAAAAGGACAAACAGTAGAGTTTGAGATTATCGGCATCTTTTCTGGTAAAAAA CAAG 

AGAAATTCACAGGCTTGTCHTCTGACTTCAGTGAAAATCAAGTCTTT^ 

GGGCAATAGTGAAGCTCAAGTCAGTGCAGCACGCTTCTATGTAGAAAATCCTAAGGAAATGGACGGACTCATGAA 
GCAGGTAGAAAACTTGGCCTTGGAAAATCAAGGCTACCAAGTCGAAA 

AGACTCAGTTGCAACTTTCCAAACCTTCCTGACCATCTTCCTTTATGGGATGTTGATAGCAGGAGCT 

ATTCTGGTTTTGTCTCTCTGGTTGAGAGAACGGGTCTATGAAGTGGGGATTTTACTTGCACTTGGAAAAGGC 

GCTCGATCrTCCTACAATTCTGTTTAGAGGTAGTTTTGGTATCTCTTGGAGCT^ 

AACGCAATCACAACTTACCTACTCCAAACTCTACTAGCAAGTGGAGATCAGGCAAGCTTACAAGATACACTAGCC 

AAAGCAAGCAGTTTATCAACTAGCATCTTATCTTTTGCAGAATCCTATGTTTTTCTAGTTCT 

TGTAGCCCTTTGTTTCCTATTCTTATTTAGAAAATCACCGAAAGAAATTTTATCATCT 

4142.5 

ATGTTACACAACGCATTTGCCTATGTTACAAGGAAGTTTTTCAAATCGATTGTCATCTTCCTG 

GGCGAGCTTGAGTTTGGTCGGCTTGTCAATCAAGGGAGCTACTGCCAAGGCTTCTCAGGAGACCTTTAAA^ 

ACCAATAGCTTCTCCATGCAAATCAATCGTCGCGTCAACCAAGGAACGCCTCGTGGTGCTGGGAATATCAAGGGT 

GAAGACATCAAAAAAATCACCGAAAACAAGGCCATTGAGTCTTATGTCAAACGTATCAACGCTATCGGAGATTTG 

ACTGGATATGACCTGATTGAAACGCCAGAAACCAAGAAGAATCTCACTGCTGATCGTGCCAAGCGTTTTGGAAGT 

AGCTTGATGATTACAGGTGTCAATGACTCCTCTAAAGAAGACAAGTTTGTCTCTGGTTCTTATAAACTAGTCGAAG 

GAGAGCACTTAACCAACGACGACAAGGATAAAATCCTCTTGCACAAGGACTTGGCAGCCAAACACGGCTGGAAA 

GTAGGGGACAAGGTTAAACTGGACTCTAATATCTACGATGCAGATAATGAAAAAGGAGCCAAGGAAACAGTTGA 

AGTGACAATCAAGGGACTCITTGATGGTCATAATAAGTCAGCAGTAACCTACTCACAAGAACTTTACGAAAACAC 

AGCTATTACAGACATTCACACTGCTGCAAAACTTTATGGATACACAGAAGACACAGCCATTTATGGGGACGCAAC 

CTTCTTTGTAACAGCAGACAAGAACTTGGATGATGTTATGAAAGAGTTGAATGGCATCAGTGGTATCAACTGGAA 

GAGCTACACACTCGTCAAGAGCTCCTCTAACTACCCAGCTCTTGAGCAATCTATCTCTGGTATGTACAAGATGGCC 

AACCTCCTCTTCTGGGGTAGCTTGAGCTTCTCAGTTCTC 

TCGCAAGGAAGTGGGAATTCTCCTCTCTATCGGCCTCAAGCAGGCAAGTATCTTGGGTCA 

ATTCATCACCGAATCTATCTTGATTGCTATCCCTGCTCTAGTTTCTGCTTACTTCCTAGCTAATTACACTGCCCGTG 

CAATTGGAAACACTGTCCTTGCCAATGTGACTTCAGGTGTTGCCAAACAGGCTAGTAAGGCGGCTCAAGCCTCTA 

ACCTTGGTGGTGGTGCAGAAGTAGATGGCTTTAGCAAGACCTTGTCGAGCCTAGACATTTCCATTCAGACATCAGA 

CTTTATCATCATTTTTGTCCTTGCCTTGGTTCTAGTGGTTCTCGTTATGGCGCTTGCTTCAAGCA 

AACAACCAAAAGAGCTCTTGCTGGATGGTGAATAA 
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ATGTCACAGGATAAACAAATGAAAGCTGTTTCTCCCCnTCTGCAGCGAGTTATCAATATCTCATCGATTGTCG 
GGGTTGGGAGTTTGATTTTCTGTATTTGGGCTTATCAGGCTC 

ATCCAGCAGGCAGGCATCrGGGGTCCACCTCTCTTTATCTTTTTACAGATTTTACAGACTG 
5 AGGGGCCTTGACCTCGGTGGCTGGGGTCTTTATCTACGGGCACATCATCGGGACTATCTACAACTATATCGGCATC 
GTGATTGGCTGTGCCATTATCTTTTATCTAGTGCGCCrATACGGAGCTGCCTITGT 
GCACCTACGACAAGTACATCGACTGGCTAGATAAGGGCAATCGTTTTGACCGCTITCTTTAT^ 

GCCCATTAGCCCAGCTGACTTTCTCTGTATGCTGGCTGCCCTGACCAAGATGAGCTTCAAGCGCTACATGACCATC 
ATCATTCTGACCAAACCCTTTACCCTCGTGGTTTATACCT 
10 GCTTTGA 



tITt.i 

ATGAGAAATATGTGGGTTGTAATCAAGGAAACCTATCTTCGACATGTCGAGTCATGGAGTTTCI'lVi'il'ATGGTGA 

TTTCGCCGTTCCTCITITTAGGAATCTCrGTAGGAATTGGGCATCTC 
15 GTGGCAGTAGTGACAACAGTGCCATCTGTAGCAGAAGGACTGAAGAATGTAAATGGTGTTAACTTCGACTATAAA 
GACGAAGCAAGTGCCAAAGAAGCAATTAAAGAAGAAAAATTAAAAGGTTATTTGACCATTGATCAAGAAGATAGT 
GTTCTAAAGGCAGTTTATCATGGCGAAACATCGCTTGAAAATGGAATTAAATTTGAGGTTACA 
AACTGCAAAATCAGCTTAATCGTTCAACTGCTTCCTTGTCT^ 

AATTCACAGAAAAGATTGATGAAGCCAAGGAAAATAAAAAGTTTATTCAAACAATTGCAGCAGGTG 
20 TCIUU'C'T T TATATGATTCTGATTACCTATGCGGGTGTAACAGCTCAGGAAGTTGCCAGTGAAAAAGGCACCAAAAT 
TATGGAAGTCGTTTTTTCTAGCATAAGGGCAAGTCACTATTTCTATGCGCGGATGATGGCT 
TAACGCATATTGGGATCTATGTTGTAGGTGGTCTGGCTGCCGTTTTGCTCTTTAAAGATTTGCCATT 
TCTGGTATTTTGGATCACTTGGGAGATGCTATCTCACTGAATACCITGCTCTTTATTTT 
CGTAGTCTTGGCAGCCTTCCTAGGATCTATGGTTTCTCGT^ 
25 ATTTTGATTATGGGTGGTTTTTTTGGAGTGACAGCTCTAGGT 

TTATATTCCCTTTATTTCGACCTTCTTTATGCCGTTTCGAACGATTAATGACTATGCGGGGGGAGCAG/^ 

ATTTCACTTGCTATTACAGTGATTTTTGCGGTGGTAGCAACAGGATTTATCGGACGCATGTATGCTAGTCT 

TCAAACGGATGATTTAGGGATTTGGAAAACCTTTAAACGTGCCTTATCrTATAAATAG 

30 4144.3 

ATGACAGAAACCATTAAATTGATGAAGGCTCATACTTCAGTGCGCAGGTTTAAj\GAGCAAGAAATTCCCCA^ 

GACTTAAATGAGATTTTGACAGCAGCCCAGATGGCATCATCTTGGAAGAATTTCCAATCCTACTCTGTGATTGTGG 

TACGAAGTCAAGAGAAGAAAGATGCCTTGTATGAATTGGTACCTC 

TCTCTTTGTCGGAGATTTGAACCGAGCAGAAAAGGGAGCCCGACTTCATACCGACACCTTCCAACCCCAAGGTGT 

35 GGAAGGTCrCTTGATTAGTTCGGTCGATGCAGCTCTTGCTGGACAAAACGC 

TATGGTGGTGTGATTATCGGTTTGGTTCGATACAAGTCTGAAGAAGTGGCAGAGCTCTTTAACCTACCTGACTACA 
CCTATTCTGTCTTTGGGATGGCACTGGGTGTGCCAAATCAACATCATGATATGAAACCGAGACTGCCACTAGAGA 
ATGTTGTCTTTGAGGAAGAATACCAAGAACAGTCAACTGAGGCAATCCAAGCTTATGACCGTGTTCAGGCTGACT 
ATGCTGGGGCGCGTGCGACCACAAGCTGGAGTCAGCGCCTAGCAGAACAGTTTGGTCAAGCTGAACCAAGCTCAA 

40 CTAGAAAAAATCTTGAACAGAAGAAATTATTGTAG 

4146.1 

ATGTTAAAACTTATTGCTATTGTTGGAACAAATTCAAAACGTTCTACAAACCGTCAATTGCTTC 
AACACITTACTGACAAAGCTGAAATTGAACTTGTTGAA 
45 GCAAGTACCTGCTGAAATATTGGAAATTGCTGCTAAAATCGAAGAGGCAGATGGCGTTATTATCGGTACTCCTGA 

GTATGATCACTCTATTCCAGCTGTTTTGATGAGCGCTCTTGCTTGGTTGTCCT 

AACCAATCATGATTACAGGTGCTTCTTACGGTACGCITGGTTCATCTCGTGCCCAATTGCAACTTC 
GAATGCTCCTGAAATCAAGGCAAATGTTCTTCCAGATGAATTCTTG 

AGTGGCGACITGGTTGACCTTGATGTTATCAAGAAATTGGATGCCATCTTTGATGACTTCCGTATCTT^ 
50 TCACAGAAAAATTACGTAATGCACAAGAATTACTTCGCAAAGATGCTC 

4146.2 

ATGAATACCTATCAATTAAATAATGGAGTAGAAATTCCAGTATTGGGATTTGGAACITTTAAGGCTAAGGATGGA 
GAAGAAGCCTATCGTGCAGTGTTAGAAGCCTTGAAGGCTGGTTATCGTCATATTGATACGGCGGCGATTTATCAGA 
55 ATGAAGAAAGTGTTGGTCAAGCAATCAAAGATAGCGGAGTTCCACGTGAAGAAATGTTCGTAACTACCAAGCTTT 
GGAATAGTCAGCAAACCTATGAGCAAACTCGTCAAGCTTTGGAAAAATCTAT^ 

TGGATTTGTATTTGATTCATTGGCCGAACCCAAAACCGCTCAGAGAAAATGACGCATGGAAAACTCGCAATGCGG 
AAGTTTGGAGAGCGATGGAAGACCTCTATCAAGAAGGGAAAATCCGTGCTATCGGCGTTAGCAATTTTCTTCCCC 

ATCATTTGGATGCCTTGCTTGAAACTGCAACTATCGTTCCT 
60 TCAAGATCAAGTCGTAGCTTACTGTCGTGAAAAGGGAATTTTATTGGAAGCTTGGGGGCCTTTTGGACAAGGAGA 
ACTGTTTGATAGCAAGCAAGTCCAAGAAATAGCAGCAAATCACGGAAAATCGGTTGCTCAGATAGCCTTGGCCTG 
GAGCTTGGCAGAAGGATTTTTACCACTTCCAAAATCTGTCACAACCTCTCGTATTCAAGCTAATC^ 
GGAATTGAACTGAGTCATGAGGAGAGAGAAACCTTAAAAACGATTGCTGTTCAATCGGGTGCTCCACGAGTTGAT 

GATGTGGATTTCTAG 

65 
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4147.1 

ATGAGGTGCAAAATGCTTGATCCAATTGCTATTC 

TGACAGGCITGATTCITGCGGTTTATTTGACCATGAAA^ 

AGATTTTATCTTAGTAGCCTTTCCCTTGGCTATTTTAGGAGCTCGTCTCTACT 

TAGTCAGAATTTAGGAGAGATTTTTGCCATTTGGAATGGTGGTTTGGCCATTTACGGTGGTTTGATAACT 

CTTGTGCTCTATATCTTTGCTGACCGTAAACTCATCAATACTTGGGATTTTCT 

GATTGCTCAAAGTTTGGGGCGTTGGGGTAATTTCTTT 

TATCTACCTGGCTTTATCCGTGACCAGATGTATATTGAGGGGAGCTACCGTCAACCGACTTTCCITTATGA 
TATGGAATCTGCTTGGCTTTGCCTTGATTCTC 

GGCCITTTACTTGATTTGGTATGGTTTCGGTCGTATGGTTATCGAAGGTATGCGAACAGATAGTCT 

GCITTCGAGTGTCCCAATGGCTGTCAGTTGTCCTTATCGGTCTCGGTATAATGATCGTTATTTATCAAAA 

GAAGGCCCCTTACTATATTACAGAGGAGGAAAACTAA 

4147.2 

ATGGGTAAATTATCCTCAATCCTTTTAGGAACCGTTTC^^ 

GCAAACAAGTTTGCAGTCAGGCTCAAGATTTTCTAGATGATTTGAGAGAAGATCCGGAGTATGCCAAGGAGCAAG 

TCTGTGAAAAACTGACAGAAGTTAAGGAGCAGGCTACAGATTTTGTTCTGAAAACAAAAGAACA 

GTGAAATCACTGTGGACAGTATACTTGCTCAAACTAAATCCTA 

AATTAAATAATCTCAAGGAGCAATGGCAAGAAAAAGCCGAAGCT^ 

TAACAGAAGAATAA 

4147.3 

ATGAAAACTAAATTGATCTTTTGGGGCTCTATGCTCTTTCTCCTCTCCCT 

TTGGATTTTCTATCCTATGGAGATTCAGTGGCTAAACTTAACGAA 

TACAATTTTCATATCTTGATGAATTATCTGACCAATC 

AGCAGCTGGTCTGCACCATTTCGCAGTGGTCAAGAATCTCTTTCATTTGGTTCAGCTAGTAGCTCT 

C C AAGTTTCT ATGTCTTTGTC AAT AG G ATTGTG AAAAAG G A CITTTTGTCTCTTT ATCG AAAAAGT 

AGTAGTCITACCTGTGATGATTGGACITGGGGGAGTTTTGATTGGT^ 

TTCTCITTGTGGGAGATGATACCTGGCTTTTTGATCCAGCCAAGGATCCTGTTATT 

TTTCTTCATGCCirCCTCCTCITTTTTGCCCTCTATG^ 

GTGA 

4149.1 

ATGACITATCATTTTACTGAAGAATACGATATTATTGTAATTC 
CCGCTAGCCGTATGGGCTGTAAGGTCCTG(nTGCGACCATCAATATTG^ 

CTCTATCGGTGGTTCTGCCAAGGGGATTGTCGTGCGTGAAGTCGATGCCCTCGGTGGCGAGATGGCCAAAACCATT 
GACAAGACTTACATCCAGATGAAGATGCTAAACACAGGGAAGGGGCCAGCTGTCCGTGCCCTTCGTGCGCAGGCT 
GACAAGGAACTTTACTCn\AAGGAGATGCGCAAGACGGTTGAAAACC 

ATTGATGAGATTTTGGTGGAAGATGGCAAGGTTGTCGGTGTGCGTACAGCCACCCATCAAGAATATGCTGCTAAG 

GCTGTTATTGTGACGACAGGGACTGCTCTCCGTGGGGAAATTATCATCGGAGACCTCAAGTACTCATCAGGTCCTA 

ACCACAGCTTGGCTTCTATTAACCTAGCTGACAATCTCAAGGAACTGGGTCTCGAAATCGGTCGTTT^ 

AACCCCTCCACGTGTCAAGGCTTCITCTATCAATTACGATGTGACAGAAATTCAGCCAGGAGACGAAGTGCCTAAT 

CATTTCTCATACACTTCACGTGATGAGGATTATGTCAAGGACCAAGTACCATGCTGGTTGACCTATACCAATGGTA 

CCAGTCATGAGATTATCCAAj\ACAACCTCCACCGTGCGCCTATGTTTACAGGTGTGGTCAAGGGAGTGGGGCCTC 

GTTACTGTCCGTCGATTGAAGACAAGATTGTGCGCTTTGCGGACAAGGAACGTCACCAACTCTTCCTTGAGCCAG 

AGGGCGCAATACTGAGGAAGTCTATGTGCAAGGACTTTCAACCAGTCTGCCTGAGGATGTCCAGCGTGACTTGGT 

GCATTCCATCAAAGGTTTGGAAAATGCAGAGATGATGCGGACAGGTTATGCTATTGA 

GTATGATATGGTCnTGCCTCATCAGTTGCGTGCGACTTTC 

CAGACAAATGGAACATCAGGTTACGAAGAGGCAGCAGGCCAAGGGATTATCGCGGGTATCAATGCGGCTCTGAA 

AATCCAAGGCAAGCCTGAATTGATTTTGAAGCGCAGTGATGGTTATATCGGGGTGATGATCGACGACTTGGTGAC 

CAAGGGAACCATTGAACCCTACCGTCTCTTGACCAGTCGTGCTGAATACCGTCTCATTCITCGTCATGACAATGCT 

GATATGCGCITGACTGAGATGGGACGCGAGATTGGCCTTGTGGACGATGAACGCTGGGCTCGTTTTGAAATC 

AAAAATCAATTTGATAATGAGATGAAGCGCCTAGACAGTATCAAACTCAAGCCAGTCAAGGAAACCAATGCCAAG 

GTTGAGGAGATGGGCTTCAAACCCrrGACCGATGCAGTGACAGCCAAGGAAT^ 

ACCAAGATGTGGTGGCCTTCATCGGACCAGCTGCAGAAGACTTGGATGACAAGATTATCGAATTGATTGAAACAG 

AAATCAAGTATGAAGGCTATATTTCCAAAGCCATGGACCAGGTTGCCAAGATGAAACGCATGGAAGAAAAACGCA 

TTCCGGCCAATATCGACTGGGATGACATTGATTCTATCGCAACCGAAGCCCGTCAGAAGTTCAAACTCATCAATCC 

AGAAACCATCGGCCAAGCCAGCCGTATTTCGGGAGTAAACCCAGCAGATATTTCTATTTTGATGGTGTATCTGGAA 

GGTAAAAATCGTAGTATTTCTAAAACTCTTCAAAAATCAAAATGA 



4149.2 

ATGAAAGTATTAGCTTTTGATACGTCCAGCAAGGCTCTTTCTCTGGCTATTTTAGAGGATAAGCAGGTTCTTGC 
AGACGACGATTAATATTAAGAAAAATCACAGTATTACTCTTATGCCTGCCATCGATTTTTTGATGGCAAGTTTGGA 
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TTGGACACCCAAGGATTTGGACCGAATCGTGGTAGCTGAAGGGCCGGGTAGCTATACAGGCTTGCGAATTGCGGT 

AGCAACTGCTAAGACCTTAGCTCACACCCTGAACATCGAGTTC 

CATCAACAAGAAGGTTTGTTTGTCCCCHTGATGGATGCGCGTCGCAATA^ 

CCAAACCTGTCATGGCAGAAGCGCACCTATCTTTTGAAGAGGTGCTAGAAAAAGTCAAGGGTACTAGTCAGGTAA 

CCTTTGTCGGAGAAGTTGGCCCCTITGTTGAGCAGATTCAAAAACACTTGCCAAGGACrGAT^ 

GCCCAATGCAGCTAATCTAGCTCTTTTGGCCTGGGACAAGGAAGCAGACTCCTTGCATGATTTTG 

CTCAAACGAGTCGAGGCTGAGGAAAACTGGCTCAAGAACCATACCGAGTCTGGCGAGTCTTACATTAAACGCCTA 

TGA 

4149.3 

ATGATAGAAATCAAGCGAATTCAACAACAGCCTGACCTAGCTCAAGCCATCTACGCTGTTATGGCAGCTGTTTACC 

TAGTCAGTCCTTGGACTCTGGAGCAAATCCAAGCAGATCTGTCCCAAGACCAGACTTGGTATGCATTGGCTTATGA 

TGGGGCAGAAGTGATTGGATTTCTAGCTGTGCAGGAGAATCTTTTTGAAGCAGAAGTCCTGCAAATCGCTGTCAA 

AGGAGCTTATCAGGGTCAGGGGATTGCGTCagCCTTGTTTGCTCAATTGCCGACAGACAAGGAAATTTTCCT 

GTCAGACAGTCAAATCAACGAGCGCAAGCATTTTACAAGAAAGAAAAGATGACAGTTATCGCTGAGCGAAAGGC 

CTACTACCATGACCCAGTCGAGGACGCCATTATCATGAAGAGAGAAATAGATGAAGGATAG 

4152.2 

ATGACAAAACAAGTCTTATTAGTGGATGATGAAGAACACATTCTGAAATTGCTTGACTACCATTTAAGTAAGGA 
GG CTTTTCT ACTC AATTG GTG AC AAATGG ACGG A AGG C CTT AG CTTTGG C AG A A AC AG AAC CCTTTG ATTTT ATCT 
TGCITGATATCATGTTACCACAATTAGATGGCATGGAAGTTTGTAAGCGGCTGAGAGCCAAAGGCGTCAAAACTC 
CAATTATGATGGTTTCTGCGAAAAGTGATGAATTTGATAAGGTTTTGGCCTTGGAAT^ 

GACCAAGCCTTTTAGCCCTAGAGAATTGCTGGCGCGTGTCAAGGCTGTCCTCAGGCGAACrAAAGGAGAACAAGA 
AGGAGATGATTCAGATAATATCGCTGACGATTCITGGCTATTTGGGACCrrGAAAGTATACCCTGA 
GTCTACAAGGCGAATAAGTTACTGAGTTTGACCCCAAAAGAATTTC 
TCAAAGTTTCGAAAGTAACCGCCCAATAA 

4154.1 

ATGACTACTTTTAAAGATGGATTTTTATGGGGTGGTGCTC^ 

GTGGCAAGGGAATTAGTGTTGCTGATGTTATGACTGCTGGTCGTCATGGAGTAGCTCGTGAAATTACTTTGGGAGT 
TTTAGAGGGTAAATATTATCCAAATCATGAGGCGATAGATTTTTATCACCGTTAT 

GCTGAAATGGGATTCAAGTGCTTCCGTACCTCTATTGCATGGACACGTATCTTTCCAAAAGGTGATGAGTTAGAGC 
CGAATGAAGAAGGATTACAGTTTTATGATAATCTTTTTGATGAATGCTTAAAGAATGGTATTGAA^ 
TCTATCTCATTTTGAAATGCCTTATCACTTAGTGACCGAATATGGTGGTTGGAAAAATAG 
TTTGCTCGTTTTGCAGAAGTCGTATTTAAACGTTACAAAGATAAGGTTAAAT^ 

ATAATCAAGCGAATTATCAGGAAGATTTTGCACCATTTACTAACTCAGGTATTGTATATGAGGAAGGTGATAATAG 

AGAAGCAATTATGTATCAAGCAGCACATTACGAATTAGTTGCTTCTGCACGAGCTGTAAAAATTGGTCATGAGATT 

AATCCAGATTTTCAAATAGGTTGTATGATTGCGATGTGTCCAATTTATCCAGTTACTTGCAATCCTAA 

A ATGG C A ATG AA AGCT ATG C AG AAG CGTT ATT ATTTTG CTG ATGTG C ATGTTTT AGGT A AAT ATC CTG AGC AT ATT 

TTCAAGTATTGGGAACGAAAAGGTATTTCAGTTGATTTTACTGCCCAGGA 

TAGATTACATTGGTTTCAGTTACTATATGTCCITTGCT^^ 

CTTGAAACAGAAGATTTAGTGAAAAATAATTATGTTAAGGCTnrCTGAATGGG 
TTGCGTTATGCGTTAAATTGGTTTACAGACCACTATCACTTACCACT 

AGATCAAGTTGCAGCAGATGGTATGGTACATGATGATTATAGAATTGAATATCTAGGTGCCCATATTCGTGAAATG 
AAAAAGGCTGTAGTTGAAGATGGTGTTGATTTAATGGGTTATACTC^ 

GTACCGGTGAAATGCGGAAACGTTATGGCTTTATTTATGTAGATAAAGATGATAATGGGAAGGGAAGTTATAATC 
GTTCCC CG AA A AA AT CTTTTGG CTG GT AT AAG G A AGTT ATTTC ATCT AACG GTG AATC AGT AG AAT AG 

4154.2 

ATGGATCAACAAAACGGGTTGTTTGGTTTTCITGAAAACCATGTTATGGGACCAATGGGCAAACTTGCT 

AAGTAGTACGTGCTATCACGGCTGCAGGTATGGCTGCTGTACCATTTACTATTGTAGGATCAATGTTTTTGGTATT 

CAGTATTTTGCCACAAGCTTTCTCATTTTGGCCAATTGTGGCAGATATTTTCT 

CACTTTACATGGTTGCAAACTATGCGACTATGGGTTCTCTATCTCTTrATTTCGTTCT 

ACAAAAATTTATGCAGAGGAAGAAGAACTCAATATGAATCCTCTTAATGGTGCCTTGCT 

TCATGACAGTACCGCAAATCATTTTTGATGGTGGAATGATGAAGACTGTGACAAGTCTAAAAGAAGGTGCAGTAA 
TTGCAGATGGATGGGCAATGGGAAATGTAGTCGCACGTTTTGGGACAACAGGGATTTTTACCGCAATCATTATGG 
CAATTGTGACTGTTCITATTTATCGTATGTGTGTTAAACATAATTGGGTTATTAAAATGCCTGAAGCT 
GGAGTTTCTCGTGGATTTACCGCTTTGGTTCCGGGATTTGT^ 

GTAGCAATGGGAACAGATATTTTTAAAGTCATTGCAATTCCATTTGGTTTTGTATCCAATCT 
TTGGTTTAATGATTATTT ATCT ATTGACTCAACTACnTTGGATTGT AGGT ATCCACGGTGCGAACATTGT^ 
TTTGTTAGTCCAATTGCTCTTGCTAACATGGCTGAAAATGCTGCTGGCGGG 

CTAATATGTTTGTAATTGCAGGTGGTTCTGGTGCAACTTTAGGACTATGTTTATATATTGCTTTTGCCT 
GAACAGCTTAAAGCAATAGGACGAGCATCTGTAGTTCCAGCCTTATTTAATATTAATG 

TACCTATTATCTATAATCCAGCCTTGGCT AT ACCATTT ATTTT AGCACCAATGGTTACTGCTACTATTTATTACGTA 
GCGAATTCTCTAAACTTTATTAAGCCAATTATCGCACAGGTTCCATGGCCAACTCCAGTAGGGATTGGAGCTTTCT 
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TAGGGACAGCAGATCTTCGAGCTGTATTAGTTGCTCTAGTATGTC 
ATCCGTGTATATGATCAAAAATTGGTGAAAGAAGAGCAAGGTATCTAA 



5 ATGAAAAAATTTrATGTAAGTCCAATTTTTCCTATTCTAGTAG 
TATTTTTGTTAATAATAATCTGTTGACGGTTT^ 

ACTGAGAGTGCATTATACAAGGAGTGATGTAGAACAGATACAGTATGTAAACCACCAAGCGGAAGAAAGTTTGAC 

AGCTCTATTGGAACAGATGCCTGTAGGTGTTATGAAATTGAATTTATCTTCTGGAG 

TATGCTGAATTGATTTTGACCAAGGAAGATGGTGATTTTGATTTAGAAGCTGTTCAAA 

10 TAGGAAATCCGTCTACTTATGCCAAGCHTGGTGAGAAGCGTTATGCTGTTCA 

GTATTTTGTAGATGTATCCAGGGAACAAGCCATAACAGATGAATTGGTAACAAGTAGACCAGTGATTGGGATTGT 
CTCTGTGGATAATTATGATGATTTGGAGGATGAAACTTCTGAGTCAGATATTAGTC 
AATTTTATATCAGAGTTTTCAGAAAAACACATGATGTTTTCTCGTCGGGTA^ 
TGACTACACGGTGCTTGAGGGCTTGATGAATGATAAATTTTCTGTTATTGATGCT^ 

15 AGACAGTTGCCCTTGACCTTAAGTATGGGATTTTCTTATGGCGATGGAAATCATGATGAGATAGGGAAAGTTGCTT 
TGCTCAATTTGAACTTGGCTGAAGTACGTGGTGGCGACCAGGTGGTTGTTAAGGAAAACGACGAAA 
CAGTTTATTTTGGTGGTGGGTCTGCTGCTTCAATCAAGCGTACACGGACTCGTACGCGC 

TTCAGATAAGATTCGGAGTGTAGATCAGGTTTTTGTAGTCGGTCACAAAAATTTAGACATGGATGCTTTGGGCT 
GCTGTAGGTATGCAGTTGTTCGCCAGCAATGTGATTGAAAATAGCTATGCTCTTTATGATGAAGAACAAATGTCT 

20 CAGATATTGAACGAGCTGTTTCATTCATAGAAAAAGAAGGAGTTACGAAGTTGTTGTCTGTT 

GGATGGTGACCAATCGTTCTTTGTTGATTCITGTAGACCATTCAAAGACAGCCTTAACATTATCAAAA 
TGATTTATTTACCCAAACCATTGTTATTGACCACCATAGAAGGGATCAGGATTTTCCAGATAATGCGGTTA 
TATATCGAAAGTGGTGCAAGTAGTGCCAGTGAGTTGGTAACGGAATTGATTCAGTTCCAGAATTCTAAGAAAAAT 
CGTTTGAGTCGTATGCAAGCAAGTGTCnTGATGGCTGGTATGATGTTGGATACTAAAAATTTCACCTCGCGAGTAA 

25 CTAGTCGGACATTTGATGTTGCTAGCTATCTCAGAACGCGCGGAAGTGATAGTATTGCTATCCAGGAAATCGCTGC 
GACAGATTTTGAAGAATATCGTGAGGTCAATGAACTTATTTTACAGGGGCGTAAATTAGGTTCAGATGTACTAATA 
GCAGAGGCTAAGGACATGAAATGCTATGATACAGTTGTTATTAGTAAGGCAGCAGATGCCATGTTAGCCATGTCA 
GGTATTGAAGCGAGTTTTGTTCTTGCGAAGAATACACAAGGATTTATCTCTATCTCAGCTCGAAGTCGT 
TGAATGTACAACGGATTATGGAAGAGTTAGGCGGTGGAGGCCACITTAATTTGGCAGCAGCTCAAATTAAAGA 

30 TAACCTTGTCAGAAGCAGGTGAAAAACTGACAGAAATTGTATTAAATGAAATGAAGGAAAAGGAGAAAGAAGAA 
TGA 

4156.1 

ATGAAAGAGAAAAATATGTGGAAAGAATTGTTGAATCGTGCAGGCTGGATTTTGGT 
35 TATATCAGGTTCCCCTAGTGGTTACCTCTATTTTGACTTTAAAAGAAGTAGCCCTGCTACAGTCAGGGCTG ATAG 
TGCTGGCCrTTCAATTGTGGTTCTGGCTCTATTTATTATGGGAGCTCGTAAA 
ClTl'l'l r r AGAGCTAAAGATTTGGCACGTTTGGGCITGAGTTATCTAGTTATTGTCGGG 

ATTTTATTGCAACTGTCAAATGAGACGACAACAGCTAACCAGTCTCAGATTAATGATATGGTTCAAAATAGTTCGT 

TGATTTCCAGTTTCTTCTTGCTAGCCTTGCITGCTCC^ 
40 AAGATTTTCCGAGGCAAGGAGAACTTGGGATTTGTAGTCGGTACGATTGTGTTTGCTTTATTGCATCAACCAA 

ATTTACCTTCTTTATTGATTTATGGAGGTATGTCGACAGTTCTATCTTGGACAGCCTACAA 

AATGTCGATCTTGCTTCACATGATTGTTAATGGGATTGCTTTCTGTTTGTTGGCTCTTGTGGTGA 

CATTAGGAATTTCTGTTTAAATGAAAGAGAAAAATATGTGGAAAGAATTGTTGAATCGTGCAG^ 

CTTTTTACrTGCCGTCCTTTTATATCAGGTTCCC^^ 
45 TACAGTCAGGGCTGATAGTTGCTGGCCITTCAATTGTGGTTCTGGCTCTATTTATTATGGGAGCT 

GTT AG CT AGTTTT AATTTTTC 1" 1"! ' f TTT AG AG CT AAAG ATTTGGC ACGTTTGGGCTTG AGTT ATCT AGTT ATTGTCG 

GGTCAAATATACTTGGTTCCATTTTATTGCAACTGTCAAATGAGACGACAACAGCTAACCAGTCTCAGATT 

TATGGTTCAAAATAGTTCGTTGATTTCCAGTTTCTTCTTGCTAGCCTTGCTTGCTC 

GTCGTGGGATTGTTCCTAAAAAGATTTTCCGAGGCAAGGAGAACTT 
50 TTTATTGCATCAACCAAGTAATTTACCTTCTTTATTGATTTATGGAGGTATGTCGACAGT^ 

ACAAGACCCAACGTTTGGAAATGTCGATCTTGCTTCACATGATTGTTAATGGGATTGCTTTCTGTTT^ 

GTGGTGATTATGAGTCGGACATTAGGAATTTCTGTTTAA 

4156.4 

55 ATGGATACACAAAAGATTGAAGCGGCTGTAAAAATGATTATCGAGGCTGTAGGAGAGGACGCTAATCGCGAGGGC 
TTGCAGGAj\ACACCTGCTCGTGTAGCCCGTATGTATCAAGAGATTTTTTCAGGTCITGGTCAAACAGCAGAGGAAC 
ATTTGTCAAAATCCTTTGAAATTATTGACGATAATATGGTGGTAGAAAAGGATATCTTTTTCCATACCATGTGTGA 
ACACCACITCnTGCCATTTTATGGTAGAGCGCACATT 

CTAGCCCGTACGGTTGAAGTTTATTCGAAAAAACCACAAATTCAAGAACGTTTGAATATCGAAGTGGCCGATGCC 
60 TTGATGGACTATCTAGGTGCTAAAGGAGCCTTTGTTGTCATTGAGGCGGAACATATGTGTATGAGTATGCGTGGTG 
TTAGAAAACCAGGCACTGCAACCTTGACGACAGTAGCTCGTGGTCTATTTGAAACAGATAAGGATCTCCGTGACC 
AAG CTT ATCGTTT AATGG GG CT AT AA 

4157.2 
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ATGAAAGACTTGTTTTTAAAGAGAAAGCAGGCCT 

ACCACTTTGTCTTGTTCCTGCTTGTCCTGTTGGGCTTTCTAGCCTAC 

GAAAATCATTGGCCTATCCITTTGTTTGTAGGAATTACGTCTGTTTTACTT^^ 

TATGGAGGCTCCAGACAAGCTCTTTCTCITAGTTGGAGAAGAGGAAATTAAGCTCCA 

CATTTCCCTAGTCTTTTGGCTCTTTGTACAGACCCTTTTCT^ 

TGGCTTGCCAGTTTTTCTGCTCTATGTGCTTTTATTGGGGGTAGGAAAATATTTC 

AATTTTTCACTGAAACTGGACTGGACTGGGACTATGTTATTTCT 

TTTCTTTGCCCTCTTTACGCAGGTCAAGGGAATTTCAAACAGCGTTAAGCGTCGTGCCT 

AGGCTGTTCAGAAGGTGCCTGGGAAGATTTGGCAAAATCTCTATCTGCGTTCTTATCTGCGAAATGGCGACCT 

TGCTCTCAGTCTTCGTCTTCTCTTGCnTTCCTTGCTGGCGCAGGTTTTTATCGA 

TGGTAGTTCTCTTTAACTACCTCTTGCTCTT^^ 

CAACT(nTTCCGCTGGACAAGGGGCAAAAGGAAAAAGGCTTACAGGAGGTAGTTCGAGGATTGACCAGTT^ 

TTACTTGTGGAATTAGTTGTTGGGTTGATTACCTTCCAAGAAAAACTAGCCCTTCrA 

TGGTTTTACTAGTCTTGTATTTGCCTTATCAGGTAAAACGTCAGATGCAGGACTAA 

4158.1 

ATGAGAAAATCAATAGTATTAGCGGCAGATAATGCCTATCTTATTCCTTTAGAGACGACTATAAAGTCT 
ATCACAATAGAGATGTTGATTTTTATATTCTCAACAGTGA 

AATGGAAGTTGTGAATTCTACAATTCGCAGTGTACACATTGATAAAGAACTTTTTGAAAGCT 

CATATAAATTATGCTTCTTACTTTAGATTTTTTGCGACAGAAGTGGTTGAATCTGATAG 

CGATATCATTGTAACTGGGGAACTAGCTACTTTGTTTGAGATAGATCTCAAAGGATATTCAATTGGTG 

GATGTCTATGCCTATGAAGGACGAAAATCTGGATTTAATACTGGTATGTTACTAATGGATGTTGCAAAGTGGAAAG 

AACATTCTATTGTCAATAGTTTATTGGAATTAGCGGCCGAGCAGAATCAAGTTGTTCATCTTGGGGATCAGAGTAT 

TTT AAAT ATTT ATTTTG AG G AT AATTG G CT AG CCTT AG AT AA A AC AT AT A ATT AT ATGGTG GGT ATTG AT ATTT ATC 

ACCTTGCTCAAGAATGTGAACGTCTAGATGACAATCCACCTACAAT^ 

GAATACATATAGTATATCTAGACTACGTGAATTATGGTGGGTTTATAGAGATTTGGATTGGTCAGAGATTGCTTTT 
CAACGTTCCG ATTT AAATT ATTTTG AAAGAAG C AATC AGTCT AAAAAAC AAGTG ATG CTTGTG AC ATG G AGTG C A 
GATATAAAACATTTAGAGTATTTAGTACAACGGTTACCTGATTGGCATTTTCATTTGGCT 

TGAGGAGCTGACCTCTCTATCACAGTATACGAATGTAACAGTATATCAAAATGTATTACATAGTAGAATTGATTGG 
CTATTGGACGATTCTATAGTTT ATTT AGAT ATT AATACAGGTGGAGAGGTTTTTAATGTAGTTACAAGGG 
AAAGTGGCAAGAAAATCTTCGCTTTTGATATCACACGTAAAAGTATGGATGATGGACTCTATGACGGTATi i TTTC 
TGTGG AG AG A CC AG ATG ATTT AGTG G AT AG AATG AAG AAT AT AG AG AT AG AGT AA 

4158.2 

ATGACTAAGATTTATTCGTCAATAGCAGTAAAAAAAGGACTATTTACCTCATTTCTACT 

GAAGTCGTATTATTCTCCCITTTGTTGACCTAAATACTAAAGATTTTTT 

TCAGCCGCCCTAACAGGTGGGAATCTAAGAAGTTTATCAATTTTTTCTGTTGGATTATCC 

TGATTTTATGGCAGATGTTTTCITTTTCTAAACGGTTGGGTTTAACATCT 

AAAATGTACCTGACCTTGCTAATTGCTGTGATTCAATCCTTGGCAGTTAGCTTG 

ATTCTGCAATATTGGTTGTTCrAATGAATACAATATTGCT^ 

TTAAATGCGAGTATGGGGATTGGAGGTTCTATTGTAATCCTCCTATCCAGTATGGTTTTAAATATTCCT 

TTTTGGAAACATTTCAGACAGTACACATTCCAACAGGGATTATTGTGT^ 

TATITACTTGCCCTTATGTATCGAGCTCGCTATTTGGTTCCT 

CTATTCTTATCTCGAAATCATGTTGAATCCTGCAGGTGGGATGCCITATATGTAT^ 

CAGCTTATTTGTTCATCTTGTTGGGATTTATTTTCCCTAATCAT^ 

GTTGGAAAGCCnTTGTGGGTCTATGTTTATATTTCGGTCTTATTTTTATT^ 

AATGGAGAAGAGATTGCAGACCGTATGAAAAAATCTGGAGAATACATTTATGGTATTTATCCAGGTGCGGATACT 

AGTCGATTTATTAATCGATTGGTCCTTCGTTTCTCAGTCATAGGTGGTCTCnTTAATGTGATTATG 

CATGCTTTTTGTTTTGTTTGATGAAAAG 

TGATTTTTACGATTAGAGACGAGGTCAAGGCTTTAAGGCTAAATGAGACCTATAGACCTTTGATTTAG 
4158.3 

ATGTCCTCTCTTTCGGATCAAGAATTAGTAGCTAAAACAGTAGAGTTTCGTCAGCGTCTTTCCGAGGGAGAAAGTC 
TAGACGATATTTTGGTTGAAGCTTTTGCTGTGGTGCGTGAAGCAGATAAGCGGATTTTAGGGATC 
TGTTCAAGTCATGGGAGCTATTGTCATGCACTATGGAAATGTTGCTGAGATGAATACGGGGGAAGGTAAGACCTT 
GACAGCTACCATGCCTGTCrATTTGAACGCTTTTTCAGGAGAAGGAGTGATGGTTGTGACT 

TCAAAGCGTGATGCCGAGGAAATGGGTCAAGTTTATCGTTTTCTAGGATTGACCATTGGTGTACCATTTACGGAAG 

ATCCAAAGAAGGAGATGAAAGCTGAAGAAAAGAAGCirATCTATGCTTCGGATATCATCrA 

ATTTAGGTTTTGATTATCTAAATGATAACCTAGCCTCGAATGAAGAAGGTAAGTTTT^ 

GATTATTGATGAAATTGATGATATCnTGCTTGATAGTGCACAAACTCCTCTGATTATTGCGGGTTCTCCT 

AGTCTAATTACTATGCGATCATTGATACACTTGTAACAACCTTGGTCGAAGGA^ 

GAAAGAGGAGGTTTGGCTCACTACTAAGGGGGCCAAGTCTGCTGAGAATTTCCTAGGGATTGATAATTTATACAA 
GGAAGAGCATGCGTCTTTTGCTCGTCATTTGGTTTATGCGATTCGAGCTCATA 

TATATCATTCGTGGAAATGAGATGGTACTGGTTGATAAGGGAACAGGGCGTCTAATGGAAATGACTAAACTTCAA 
GGAGGTCTCCATCAGGCT ATTG AAGCCAAGGAACATGTCAAATTATCTCCTGAGACGCGGGCTATGGCCTCG ATC 
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ACCTATCAGAGTCTTTTTAAGATGTTTAAT 

TTTATTGAAACTTACAATATGTCTGTAGTACGCATTCCAACCAATCGTCCGAGACAACGGA 

TTGACTATCCAGATAATCTATATATCACTTTACCTGAAAAAGTGTATGCATCCTTGGAGTACATCAAGCAATACCA 
TGCTAAGGGAAATCCTTTACTCGTTTTTGTAGGCTCAGTTGAAATGTCTCAA CTCT 

AAGGGATTGCCCATAATGTCCTAAATGCTAATAATGCGGCGCGTGAGGCTCAGATTATCTCCGAGTCAGGTCAGA 

TGGGGGCTGTGACAGTGGCTACCTCTATGGCAGGACGTGGTACGGATATCAAGCTTGGTAAAGGAGTCGCAGAGC 

TTGGGGGCTTGATTGTTATTGGGACTGAGCGGATGGAAAGTCAGCGGATCGACCTACAAATTCGTGGCCGTTCT 

TCGTCAGGGAGATCCTGGTATGAGTAAATTTTTTGTATCCTTAGAGGATGATGTTATCAAGAAATTTGGTCCATCT 

TGGGTGCATAAAAAGTACAAAGACTATCAGGTTCAAGATATGACTCAACCGGAAGTATTGAAAGGTCGTAAATAC 

CGGAAACTAGTCGAAAAGGCTCAGCATGCCAGTGATAGTGCTGGACGTTCAGCACGTCGTCAGACTCTGGAGTAT 

GCTGAAAGTATGAATATACAACGGGATATAGTCTATAAAGAGAGAAATCGTCTAATAGATGGTTCTCGTGACTTA 

GAGGATGTTGTTGTGGATATCATTGAGAGATATACAGAAGAGGTAGCGGCTGATCACTATGCTAGTCGTGAATTAT 

TGTTTCACTTTATTGTGACCAATATTAGTTTTCATGTTAAAGAGGTTCCAGATTATATAGATG 

GCAGTTCGTAGCTTTATGAAGCAGGTGATTGATAAAGAACTTTCTGAAAAGAAAGAATT^ 

TATATGAACAGTTTTTACGACTTTCACTGCTTAAAGCCATTGATGACAACTGG 

ACAGCTATCCATGGCTATCGGTGGTCAATCTGCTAGTCAGAAAAATCCAATCGTAGAGTACTATCAAGAAGCCTA 
CGCGGGCTTTGAAGCTATGAAAGAACAGATTCATGCGGATATGGTGCGTAATCTCCTGATGGGGCTGGTTGAGGT 
CACTCCAAAAGGTGAAATCGTGACTCATTTTCCATAA 

4158.4 

ATGATAGGGACTTTCGCCGCTGCTCTTGTAGCTGTACTAGCAAATTTCATCGTCCCTATTGAAATTACCCCAAATA 
GTGCCAATACTGAAATTGCACCACCAGATGGGATTGGGCAGGTTCTCAGCAACCTCTTGCTCAAACTGGTTGACA 
ACCCAGTCAACGCCCTGCTTACTGCTAACTATATTAGAATCTTAT 

AGAAGCCAGTAAAAATAGTCAAGAATTGCTAAAAACTATCGCTGACGTGACTTCTAAAATTGTCGAATGGATCAT 

CAATCTGGCTCCATTTGGAATCCTTGGTCTTGTTTTTAAAACCATTTCTGAC^ 

TACGGTATTTTATTGGTTCTATTAGTAACGACTATGCTTTTTGTTGCCCCTGTGGTCAAC 

TTTATGAGACGCAATCCTTACCCTCTAGTTTGGAACTGCCTCCGTGTCAGCGGTGTGACAGCCrrri'lCACTCGTA 

GTTCTGCGACTAACATTCCTGTCAACATGAAACTCTGCCAT 

TATCCCACTCGGTTCTACTATCAATATGGCTGGAGTAGCGATT^ 

ACTCTTGGAATTCCTGTTGACTTTGCCACAGCCTTTGTCCTCAGTGTGGTAGCAGCTATCTCATC 
AGGTATTGCCGGAGGTTCCCTCCTTCTTATCCCAGTTGCTTGTAGCC^ 

AAATTGTTGGGGTTGGTTTTGTGATTGGTGTCATCCAAGACTCATGTGAAACAGCCCTTAACTCTTCTACAGATGT 
CCTCTTTACCGCCGTTGCCGAATACGCAGCAACCCGTAAAAAATAA 

4158.5 

ATGTCTATTAGCCAACGTACGACCAAGCTCATCTTAGCTACCTGTCTTGCCTGCCTGCTTGCTTATTTT 

TTCGTCAGCAGTTTCGGCTGGAATTATCGCTCTCTTGAGCCTATCTGATACGCGTAGAAGTACTT^ 

CGCAATCGTCTTTTTTCTATGCTTCTAGCTCTGGCTATCGGTGT^ 

GAGTCTCGGCCTCTATCTGGCCTTCTACGTTCCTTTAGCCTACAAGATGGGCTGGGAAATTGGCATCACA CCAA GC 

ACTGTTTTGGTTAGCCATCTCTTGGTTCAAGAGTCAACCTCTCCAGACC^ 

TATTGGTACAGGATTTGCCTTGCTTGTTAATCT 

CTGGTGGAAGAAAAGTTAAAAGATATCCTCCAGCGCTTCAAATACTATTTATCCAGAGGAGACGGACGCAACCGA 

GCACAGCTGGTAGCAGAATTAGACACGCTTTTGAAAGAAGCCCTCAGACTGGTCTATTTGGATCACTCTGACCACC 

TCTTTCACCAGACAGACTACCATATCCACTACTTTGAGATGAGACAGCGACAAAGTCGTATCCTGAGAAACATGG 

CCCAACAGATTAACACTTGTCACCTTGCCGCCAGTGAAAGCCTGATCTTAGCGCAACTCTTTTCAAAAATTGC 

TCAACTGAGCCAGACCAATCCTGCTTCTGATTTGCTAGATGAAATTGAACGTTATCTGGAAGTCTTCCGGAACCGC 

AGTCTGCCCAAGACAAGAGAAGAATTTGAAACCCGCGCCACCCTTCTTCAACTCCTACGTGAAGCCAAAACCTTC 

ATCCAAGTAAAAGTTGATTTTTACCAAAAATATAGACAGTAA 

4158.6 

ATGGAAATCATGTCGCTTGCGATTGCTGTTTTTGCCGTCATCATTGGTTTAGTCATTGGATATGTCAGCATCTCAGC 

TAAGATGAAATCATCTCAGGAAGCTGCAGAGTTGATGCTTTTAAATGCTGAACAAGAAGCAACTAATTTACGTGG 

ACAAGCTGAGCGTGAAGCGGATTTACTTGTTAATGAAGCCAAACGTGAAAGCAAGTCTCTTAAAAAAGAAGCACT 

ATTGGAGGCCAAAGAAGAAGCCAGAAAATACCGTGAAGAAGTGGACGCTGAATTCAAATCAGAACGTCAAGAAC 

TCAAACAAATCGAAAGTCGTTTGACAGAGAGAGCTACTAGCCTTGACCGTAAGGACGACAATTTGACGAGTAAAG 

AACAAACACTTGAACAAAAAGAACAAAGTATTTCTGATAGAGCGAAAAACCTTGAT^ 

AGGAAGTCGAAAGACAAAAAGAAGCAGAACTAGAGCGTATTGGTGCGCTGTCTCAGGCAGAAGCACGAGATATT 

ATCTTGGCTCAGACAGAGGAAAACTTGACCAGGGAGATTGCCAGTCGCATTCGCGAAGCTGAGCAAGAGGTCAAG 

GAACGTTCTGACAAAATGGCCAAGGACATCTTGGTTCAAGCTATGCAACGTATCGCTGGTGAATATGTAGCGGAG 

TCAACAAACTCAACAGTTCATCTGCCAGACGATACTATGAAGGGACGCATTATTGGTCGTGAAGGTCGTAACATT 

CGTACCTTTGAAAGTTTGACAGGGGTCGATGTGATTATCGACGATACACCAGAAGTGGTGACCTTGTCAGGATTTG 

ATCCGATTCGTCGTGAGATTGCCCGTATGACTATGGAAATGTTGCTCAAAGATGGTCGTATACATCCAGCTCGTAT 

CGAAGAGTTGGTTGAGAAAAACCGTCAAGAGATTGACAATAAGATTCGTGAATACGGTGAGGCTGCTGCCTATGA 

AATTGGTGCGCCAAACCTTCATCCAGACTTGATGAAGATTATGGGACGTTTGCAGTTCCGTACTTCATATGGACAA 

AATGTTTTGCGCCATTCGATTGAGGTTGCTAAGTTGGCTGGTATCATGGCGAGCGAACTTGGTGAAAATGCGGCTC 
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TTGCCCGTCGTGCTGGATTCCTTCACGATATCGGGAAAGCCATTGACCATGAGGTTGAAGGTAGCCACGTTGAAAT 
CGGTATGGAATTGGCCCGTAAGTACAAGGAACCCCCAGTTGTGGTGAATACGATTGCTAGTCACCACGGAGATGT 
TGAAGCTGAGAGCGTGATAGCAGTTATCGTCGCTGCAGCAGATGCCTTGAGCGCAGCCCGTCCAGGTGCTCGTAG 
TGAGTCTCITGAAAGCTACATCAAGCGTCTCCATGATTTGGAAGAAATTGCTAACGGCTTTGAAG 
5 AGCTTTGCCCTTCAAGCAGGACGTGAAATTCGTATCATGGTCAATCCAGGAAAAATCAAGGACGACAAAGTCACA 
ATCnTGGCTCACAAAGTTCGTAAGAAAATTGAAAACAATCTCGATTATCCAGGAAATATCAAGGTAACCGTGA^ 
CGCGAGCTTCGTGCAGTAGATTATGCTAAATAA 

4158.7 

10 ATGATGTTAAAACCCTCTATTGATACCTTGCTCGACAAGGTTCCITCAAAATATTCACT 
AACGTGCCCACGAATTGGAAGCAGGTGCCCCAGCAACTCAAGGT 

TAGAAGAAATCGAATCAGGAAACGTTACAATTCACCCAGATCCAGAAGGAAAACGTGAAGCAGTGCGTCGCCGTA 

TCGAAGAAGAAAAACGCCGCAAAGAAGAAGAAGAAAAGAAAATCAAAGAGCAAATTGCTAAAGAAAAAGAAGA 

TGGTGAAAAAATTTAA 

15 

4161.1 

ATGTCAGCATATCAATTACCGACCGTATGGCAGGATGAAGCTAGTAATCAAGGAGCTTTTACGGGGCTAAACAGA 
CCAACAGCAGGTGCCCGTTTCGAACAAAACTTGCCAAAAGGAGAACAAGCTTTTCAGCnTT 
CCAAATGGTGTGAAGGTTACTATCTTATTGGAAGAATTA(n\\GAA 
20 ATAAGATTGCTATCATGGATGGGGATCAATTCGGATCAGACTTTGTGAAGCTCAATC CAAAT TCCAAGATTCCAGC 
CTTATTGGACCAGTCAGGTACTGAAAACGTAAGAGTCTTTGAGTCTGCTCATATTCrrC^ 
TTTGGAGCCTTTTTACCAAGTAATCCTGTGGAAAAGGTAGAAGTTTTGAATTG 
CACCTTTTCTAGGTGGGGGATTTGGACATTTCTTCAATTATGCTCCTGAAAAAT^ 

ACGATGGAAGTGAAACGCCAGTTGGATTTATTGGATAAGGAATTGGCTCAGAAACCTTATATTGCAGGCAATGAC 
25 TATACGATTGCAGATATTGCTATCTGGTCTTGGTATGGAC^ 

AATTCITGGATGCCTCAAGTTATCAAAATCTAGTAAAATGGGCAGAAAAAATTGCCAATCGTCCAGCT 
GTGGCTTGGAAGTAACTTATACAGAAATTAAATAG 

4161.2 

30 TTGGCAAGCTTGATCACTTCTATCATCATGTTCTATGTCGGTTT 

CAGTCGGGAAGAAACGGTCATTGATCCTCTTGGTGCAACTCTAGGAATCATTTCTGCAGCGATTATGTTTG 
TATCTCTACAATACTCGCCTCAGTAAGAAATCCAACTCCAATGCGCTGAAGGCAGCTGCTAAGGACAATCTTTCTG 
ACGCTGTTACCTCACTTGGAACCGCCATTGCCATCCTAGCTAGTAGTTTCAATTATCCGATTGTGGATAAACTGGT 
TGCTATCATCATCACTTTCTTTATCTTGAAGACTGCCT^ 

35 GCTTTGACGACCGCCTGCTCGAGGACTACCAAAAGGCTATCATGGAAATTCCCAAAATCAGCAAGGTCAAATCGC 
AAAGAGGTCGCACCTACGGTAGCAACATCTACCTGGATATTACACTAGAGATGAATCCTGACTTGTCTGlll TIGA 
AAGCCATGAAATCGCGGATCAGGTCGAGTCTATGCTGGAGGAGCGTTTTGGCGTCTTTGATACCGATGTCCATATC 
GAACCAGCACCTATCCCTGAGGATGAAATTTTAGACAATGTCTATAAAAAATTGCITATGCGTGAA 
ACCAAGGAAACCAACTAGAAGAACTCTTGACTGATGATTTTGTCTATATTCGCCAAGATGGAGAGCAGATGGATA 

40 AAGAGGCTTATAAGACCAAAAAAGAGTTAAATTCTGCTATCAAGGACATTCAAATTACTTCCATCAGTCA 

CCAAACTCATCTGCTATGAGTTAGATGGTATCATCCATACCAGTATCTGGCGTCGCCACGAAACCTGGCAAAATAT 
CTTTC ATC AAG AAAC C A A AAAAG AAT AG 

4162.1 

45 ATGACAATTAAACTAGTAGCAACGGATATGGACGGAACCTTCCTAGATGGGAATGGACGCTTTGATATGGATCGT 
CTCAAGTCTCTCTTGGTTTCCTACAAGGAAAAAGGGATTTACITTGCGGTAGCT^ 
TAGAAAAATTATTTGCTGGTGTTCGTGATGACATTATTTTCATCGCGGAAAATGGCAGT^ 
TCAGGACTTGTATGAAGCGACTATGTCTCGTGACTTTTATCTGGCAACTTTTGA^ 

GTAGATATCAATAAACTGCTCTTGACGGGTAAGAAGGGTTCATATGTTCTAGATACGGTTGATGAGACCTATTTGA 
50 AAGTGAGTCAGCACTATAATGAAAATATCCAAAAAGTAGCGAGTTTGGAAGATATCACAGATGACATTTTCAAAT 
TTACAACCAACTTCACAGAAGAAACGCTGGAAGATGGGGAGGCTTGGGTAAACGAAAACGTTCCTGGTGTTAAGG 
CCATGACAACTGGCTTTGAATCCATTGATATTGTTCTGGACTATGTCGATAAGGGAGTGGCCATTGTTGAATTAGT 
TAAAAAACrTGGTATCACAATGGATCAGGTCATGGCTTTTGGAGACAATCTTAATGACTTACATATGATGCAGGTT 
GTGGGACATCCTGTAGCTCCTGAAAATGCACGACCTGAAATTTTAGAATTAGCAAAGACTGTGATTGGTCACCATA 
5 5 AGG AACGGTCGGTT AT AGCTT AT ATGG AGGGCTT AT AA 

4162.2 

ATGGCAGATATAAAATTGATTGCATTGGACTTGGACGGGACCTTGCTGACTACTGATAAAAGGCTGACGGATCGT 
ACCAAGGAAACCTTGCAAGCTGCGCGTGATCGTGGTATCAAGGTCGTATTGACAACTGGTCGTCCCTTAAAAGCC 

60 ATGGATTTCTTTCTCCATGAGTTAGGGACTGACGGTCAGGAAGATGAGTATACCATTACTTTTAATGGTGGATTAG 
TTCAGAAAAATACAGGAGAAATCCTTGATAAAACAGTCTTTTCATATGATGATGTGGCACGTTTGTATGAAGAAAC 
AGAGAAATTATCACTGCCTCTTGATGCCATCTCAGAAGGAACAGTTTATCAAATCCAATCGGACCAAGAAAGTCT 
TTATGCCAAATTCAATCCAGCITTGACCTTTGTTCCAGTGGACTTTGAAGACTTATCTAGTCAAATGACCTACAAC 
AAATGCGTGACTGCCTTTGCTCAAGAACCCnTGGATGCAGCCATTCAGAAGATTTCTCCAGAATTGTTTGACCAAT 

65 ATGAAATCTTTAAATCACGTGAAATGTTGCTAGAATGGTCACCAAAGAATGTTCATAAAGCAACAGGTTTGGCAA 
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AACTAATCAGCCATCTTGGAATCGACCAAAGTCAAGTGATGGCTTGTGGTGACGAGGCCAATGACCTCTCTATC 

TTGAATGGGCAGGTCTTGGTGTTGCTATGCAAAACGCTGTTCCTGAAGTAAAGGCAGCCGCAAATGTAGTGACGC 

CGATGACCAACGATGAGGAAGCTGTCGCCTGGGCTATCGAAGAATATGTGCTAAAGGAGAACTAA 

5 4164.2 

ATGGAAAGTTTACTTATTCTATTATTAATTGCCAATCTAG 

AGGAGAAACACTTAAGTAAGAGCITGGAGGATCAGGCAGATCATTTGTCAGACCAGTTGGATTACCGCTTTGACC 

AAGCCAGACAAGCCAGCCAGTTAGACCAAAAAGATTTGGAAGTGGTTGTCAGCGACCGTTTGCAAGAAGTGCGGA 

TTGAATTGCACCAAGGTCTGACCCAAGTCCGTCAAGAAATGACAGATAATCTCCTCCAAACTAGAGACAAGACAG 

10 ACCAACGTCTCCAAGCCITGCAGGAATCAAATGAGCAACGTTTGGAACAAATGCGCCAGACGGTCGAGGAAAAAC 
TAGAAAAGACCTTGCAGACACGCTTACAGGCTTCCTTTGAGACAGTTTCTAAACAA 
CCTTGGAGAAATGCAGACAGTTGCCCGTGATGTCGGAGCTCTTAACAAGGTTCTCTCTGGAA 
GATTCTGGGAGAATTGCAACTGGGGCAAATTATTGAAGACATCATGACACCTGCCCAGTACGAACGAGAATACGC 
AACGGTTGAAAACTCTAGTGAACGAGTGGAGTATGCCATCAAGTTACCCGGACAAGGCGACCAAGAATACGTCTA 

15 TCTGCCAATTGACTCTAAGTTTCCACTGGCAGATTATTACCGCTTGGAAGAAGCCTATGAGACAGG 

GAGATTGAACGCTGTCGTAAGTCACTCCTAGCAAGCGTCAAGCGCTTTGCTAGGGATATTAGGAACAAGTACATA 
GCACCACCTCGGACGACCAATTTTGGAGTTTTGTTTGTTCCGACAGAAGGTCTCTACTCAGAAATCGTCCGCAATC 
CGGTCHTCTTTGATGATTTGAGACGGGAAGAACAGATTATTC^ 

CTCCCTATCAGTTGGTTTCAAGACCCTTAATATCCAAAAGAGTGCCGACCATATCAGCAAGACTC^ 
20 AAGACCGAGTTTGGCAAGTTTGGTGGTATTCTGGTCAAGGCACAAAAACATCTCCAACATGCCTCTGGCAATAT^ 
ATGAATTATTAAACCGTCGTACCATAGCTATCGAGCGGACGCTCCGTCACATTGAGTTGTCAGAAGGTGAGCCTGC 
GCnTGATCTACTCCATTTTCAAGAAAATGAGGAAGAATATGAAGATTAG 

4164.3 

25 ATGAAGATTAGTCACATGAAAAAAGATGAGTTATTTGAAGGCTTTTA 

ACTCGAGCTGGGAAAAACTACCTAGCCTTTACCTTCCAAGATGATAGTG 

GCCCAACCTCATAACATTGAGGCCnTTACCGCAGGTAAGGTTGTCCACATGAAAGGACGCCGAGAAGTTTATAAC 

AATACCCCTCAAGTCAATCAAATTACTCTCCGCCTGCCTCAAGCTGGTGAACCCAATGACCCAGCTGATT^ 

TCAAGTCACCAGTTGATGTCAAGGAAATTCGTGACTACATGTCGCAAATGATTTTCAAAATTGAAAATCCTGT 

30 GCAACGGATTGTCCGAAATCTCTACACCAAGTATGATAAGGAATTCTACTCCTATCCAGCTGCCAAGACCAACCA 
CCATGCCTTTGAAACGGGCTTGGCCTATCATACGGCGACCATGGTGCGTTTGGCAGACGCTATTAGCGAAGTTTAT 
CCTCAGCTCAATAAGAGCCTGCTCTATGCGGGGATTATGTTGCATGACTTAGCTAAGGTCATCGAGTTGACGGGGC 
CAGACCAGACAGAGTACACAGTGCGAGGTAATCTTCTTGGACATATCGCTCTCATTGATAGCGAAATTACCAAGA 
CAGTTATGGAACTCGGCATCGATGATACCAAGGAAGAAGTCGTTTTGCTTCGTCATGTCATCCTCAGTCACCACGG 

35 CITGCTTGAGTATGGAAGCCCAGTCCGTCCACGCATTATGGAAGCAGAGATTATCCATATGATTGACAATCTGGAT 
GCAAGCATGATGATGATGTCAACAGCTCTTGCTTTGGTGGATAAAGGAGAGATGACCAATAAAATCTTCGCTATC 
GATAATCGTTCCTTCTATAAACCAGATTTAGATTAA 



40 ATGAGTGAAAAAGCTAAAAAAGGGTTTAAGATGCCTTCATCTTACACCGTATTATTGATAATCATTGCT 
CAGTGCTAACTTGGTTTATCCCTGCGGGGGCCTTTATAGAAGG^^ 

GATTTGGGATGTCCTCATGGCACCGATTCGGGCTATGCTAGGTACTCATCCAGAGGAAGGTTCGCTCATTAAAGA 
ACGAGCGCAGCGATTGATGTAGCCTTCTTCATCCTTATGGTTGGTC 

CTCTTGACGTAGGGATTGCCTCTATCGTGAAGAAGTATAAGGGCCGCGAAAAAATGTTAATTTTGGTACTGATGC 
45 TTTGTTTGCCCTCGGTGGTACAACTTATGGTATGGGTGAAGAAACAATGGCCTTCTATCCACTCCTTGTG 

ATGATGGCCGTTGGTTTTGATAGCCTGACTGGTGTTGCAATTATTTTGCTCGGTTCTCAAATCGGCT 

TACTCTGAATCCATTTGCGACAGGTATTGCTTCAGCGACTGCGGGAGTTGGTACAGGGGACGGTATCGTACTTCG 

CTGATCTTCTGGGTTACCTTGACrGCTCTTAGTACTTGGTTTGTTTACCG^ 

GACTAAGTCACTGGTTTATAGTACTCGCAAAGAAGATTTGAAACACITTAACGTAG^ 
50 TCTACACTTAGCAGCAAACAAAAATCAGTTCTCITCrTATTTGTGTT 

TCCATGGACAGACCTTGGCGTTACCATTTTTGATGACnTTAATACnTGGTTG 

TTGTCGGTTCATCTACTTCTGCACTAGGTACTTGGTAC^ 

CTGATTGGTGTTATTTATGGTCTTAAAGAAGATAAGATTAT^ 

TGTTGCCTTGATCGTAGCGATTGCTCGTGGTATTCAAGTTATCATGAACGACGGTATGATTACCGATACAATCCTC 
55 AACTGGGGTAAAGAAGGCTTGAGCGGTCTATCTTCACAAGTCTTTATC 

GTCATTCTTGATCCCATCTTCATCTGGTCTTGCCAGCGCAACTATGGGTATCATGGC 
TCCACnTGGAGAATTTGTAAATGTCCGTCCTAGCTTGATTATCACTGCTTACCAATCTGCT^ 

TGATTGCACCAACATCTGGTATTGTGATGGGAGCTCITGCACTTGGACGTATCAACATTGGTACTTGGTGGAAA^ 
CATGGGCAAACTCGTAGTCGCTATTATTGTAGTGACCATCGCCCTTCITCTCCTTGGAACCTTCCTTCCATT 
60 AA 



4166.3 

ATGAAAATAGATATAACAAATCAAGTTAAAGATGAATTTCTTATATCA 
TACTCAATGAAGGAGAAAATGGAACACCTTTTGGACAAGCAATCCAAGATGTCCT 
65 GTCGAGACATAGGTTTCACTACCTATCTTGACCCTAAAGGTTATTACGGATATGCAGAAATCGGTCAGGGAGCAG 



SUBSTITUTE SHEET (RULE 26) 



WO 00/06737 



52 



PCT/GB99/02451 



AGCTTCTGGCCATTCTCTGTCATTTGGATGTTGTTCCAT 

AGCAACTATCAAAGACGGCTGGGTATTCGGACGTGGTGTCCAAGATGATAAAGGCCCnTCGCTCGCAGCTCTCTA 

TGCAGTAAAAAGCnTGCTGGACCAAGGTATTCAGTTCAAAAAGCGCGTACGCTTTATCTTTGGTACCGATGAGGA 

AACCCTCTGGCGCTGCATGGCACGCTACAATACCATCGAAGAACAGGCCAGTATGGGCriTGCACCTGACTCATC 

TTTTCCTCTGACCTATGCTGAAAAAGGGCTTCTACAGGTCAAACTTCATGGCCCT 

GAAGTAGGAGGCGCCITTAACGTTGTACCAGACAAGGCCAACTACCAAGGTCT^ 

GGTCTCAAAGAAGCTGGTTATGATTACCAAACCACTGAACAAACCGTAACGGTTCTCGGAGTGCCAAAGCATGCT 

AAGGATGCTAGTCAAGGTATCAATGCTGTCATCCGACTAGCTACCATTCITGCTCCTCTCCAAGAACACCCT 

TCAG'lU'i'rC l l'GCAACACAAGCAGGTCAAGACGGCACAGGAAGACAAATCTTTGGTGATATAGCAGATGAACCnT 

CTGGTCACCTATCCTTTAATGTCGCAGGTCTCATGATCAATCATGAACGTTCTGAAATCCGTATTGACATTCGGAC 

TCCTGTCTTAGCTGACAAGGAAGAACTAGTAGAGTTGCTTACAAGATGTGCACAAAACTACCAACTCCGCT 

AGAGTTTGACTATCTAGCGCCTCTATACGTCGCAGAAGACAGTAAACTCGTTAGCACACTGATGCAAATCTA 

AGAAAAGACTGGCGATAACAGTCCTGCTATTTCATCCGGTGGTGCCACTTTTGCTCGCACCATC 

GCCTTCGGCGCCTTATTCCCAGGAGCGAAGCAGACAGAACATCAGGCAAATGAATGTGCCGTTCTAGAAGATTTG 

T AC CGTG CT ATGG AT ATTT ATG C CG A AG C CGTCT ATCG ACTTG C AA CTT AA 

4169.1 

ATGTCTAATTCATTTGTCAAGTTGTTAGTCTCTCAATTATTTGCAAATTTAGCAGATA'r 
CATTGCTAACATATACATTATTTCAAAATCAGTAATTC 

TTGTTGCGAGTCTTTTAGTTCCGTTGGTTACTAAAAGGTTAGCGCTAAATAGGGTTTT 

AAGACTATATTATTGGCGATACTGGTAGGAATGTTTACCGTAATGC 

TTGTTGTTGCAATTTCCATACTAGATGGTTTTGCAGCACCCGTTTCCTATGCTATTGTGC 

TTGGGTAAGGCTAATTCAGCCITATCAATGACTGGTGAAGCTGTTCAATTGATAGGTTGGGGATT 

TGTTTGCAACAATTGGTCTGTTACCTACCACGTGTATCAATTTAGTCTTGTATATCATTTCT 

TTTCTTCCTAACGCTGAAGTGGAGGTGTTAGAGTCAGAAACT 

TTGCTAGAAATCCTAGATTAAGACTTTTTGTATCAGCAAATTTATTGGAAATTTTTT 

TCCATTATACTTGTTTTTGTAACGGAGTTATTAAATAAAACGGAAAGTTACTGGGGATATTCT 

CTATTGGTATTATAATTAGTGGCTTAATTGCITTTAGGCTATCTGAAAAGT^ 

ATTATTCACCCCAAATCTAAAAACCATCCAGAATCCTTGCCTT^ 

ATGGGTGTTTTTTACTAGACAAAAAAGAGTTTCCCCITTATGGTATA^ 

AAACTCACATGAACAGTTTACCAAATCATCACTTCCAAAACAAGTCT^ 

TTTAACCCAGTATGGTGGTCTTATCrniaTCAGGAACri-ll-l^CCCAGTTGAAACTAAAAGAG 

ATTTAGTAACGAATGACCAACGCCGCTACTGTCGTTATTCGGATTCAGATATCCTTGTCCAGTTCCTCTTTCAA 

TTAACAGGTTATGGAACGGACTATGCTTGTAAAGAATTGTCAGCTGATGCCTACTT 

TCCAAAATTGTTGGAAGGAGGGCAGCITGCTTCACAGCCAACCT^^ 

ACAGTCCATAGTTTGCGATGCCTCAACCTTGAATTGGTCGAATTCTITrTACAGTTTC^ 

TTGTAGATATCGATTCTACCCATTTCACAACTTATGGCAAGCAAGAAGGTGTTGCTTATAACGCCCACTATCGTGC 

TCATGGCTATCATCCTCTTTATGCTTTCGAGGGGAAGACAGGTTATTGTTTCAATGCCCAGCIT CG 

GTTATTGTTCTGAAGAGGCAGACAGCTTTATCACACCTGTTTTAGAACGGTTTAATCA 

TAGTGGCTTTGCGACCCCAAAATTATACGATTTAATTGAAAAAACAGGGCAATACTACCT 

AAATACTGTTCTGAGCCGTCTTGGAGACCTTTCCCTCCCTTGC^^ 

TCCGCCTACTCAGAAACTCTCTATCAAGCAGGATCTTGGTCGCACAAGCGTCGTGTCTGCCAGTTCTCTGAACGAA 
AAGAAGGAAACTTGTTCTACGATGTTATTTCTCTCGTTACAAATATGACGAGTGGAACAAGCCAAGACCAGTTTCA 
GCTTTATCGTGGACGTGGTCAAGCCGAGAATTTCATCAAGGAGATGAAGGAG 
TAGTTCAACCTTAATCAAAAACGAAGTTCGTATGATGATGAGCTGTATCGCC 

CATCTAGCTGGAGGTGACTTCCAAACITTAACAATCAAACGCTTCCGCCATCri'rrrCl 1 CACGTGGTGGGAAAAT 

GTGTTCGAACAGGACGCAAGCAGCTCCTCAAATTGTCTAGTCTCTATGCCTAT^ 

TTCTAGGATTAGAAAAGTCAACCTGAATCITCCTGTTCCT 

CATTAA 

4169.3 

ATGATGGAGTTTTTTCAACAGCTTCCTCATTTAGAGCCATATC^ 
AACCITGCCCATCTTTATAGGTCTCTTTTTCAAGAAACGCTTTGCCTGGTA 

TTGTCACCATGTTGGTGGGTGGAAAGACCAATCAACTAGCTGCCTTGGGTATTTACCTTTGCTGGGAAATATTGCT 
CCTGCTTTTCTACAAGCATTATCGAAAAAGCAAGGATGGCAAG 

CTTCCGATTATCITTGTCAAGGTGCAACCAGCTATCAATGGAACGCAGTCTTTGCTTGG^ 

CCTGACCTTTCGTTCGGTTGGAATTGTCATCGAG(HT5AGAGATC 

CTCCGTTTCCTTCTCITCATGCCAACTTTCTCGAGTGGTCCAATCGATCGCTTTAA 

GGCTATTCCTGAGCGAGATGAGTTGATGGATATGCTGGATGAATCTGTCCGCTATATCATGTGGGGCTTTTTGTAT 
AAGTTTATCCTAGCTCATGTTTTAGGAGAGACCnTACT^ 
TCTTTAATCTCTATGCCITGGCAGTTATGTATACTTTTGGTCTGGAACTC^ 
TTGCTTTGGCCATCTCAAACTTGATGGGAATCCGTAGCCCT^ 

AAAGGAGTTTTGGAATCGCTGGCATATGAGTCTGTCCrrTCTGGTTCCGTGACTTTGTCTTTATGCGAATGGTGATG 

GTGTTAACCAGAAAGAAAGTCTTTAAAAATCGTAATGTAACCTCAAGCATGGCCTACATTGTAAATATGCTGATTA 

TGGGATTTTGGCATGGTGTGACCTGGTACTATATCGCCTATGGACTCTTTCATGGACTAGGCTTGGTCATCAATGA 
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TGCCTGGGTTCGCAAGAAAAAAACGCTCAATAAGGAACGGAAAAAAGCAGGGAAGGCTGCCCTACCTGAGAATC 

GCTGGATTCAGTTGCTTGGCATGGTTGTCACTTTCCA 

AT AATCT ATG GTTT AAAAAAT AA 

4169.4 

ATGCTTAAACGCTTATGGATGATCTTCGGACCGGTCITGATCGCTGGTTTGTTGGTTTTTCT 

CCTACTGAGATGCATCAT AATCT AGGAGCTGAAAAGCGTTCAGCAGTGGCTACTACTATCGATAGTTTTAAGGAGC 

GAAGTCAAAAAGTCAGAGCACTATCTGATCCAAATGTGCGTTTTGTTCCCTTCTTTGGCTCTAGTC 

TTTTGACGGTGCTCATCCTGCGGTATTAGCTGAGAAATACAATCGTTCCTACCGTCCrTATCTT^ 

GGAGCTGCATCGCTTAACCAATATTTTGGAATGCAACAGATGTTACCACAGCTGGAGAATAAACAAGTT 

GTTATCTCACCTCAGTGGTTCAGTAAAAATGGCTATGATCCAGCAGCCTTCCAGCAGTATTTTAATGGAGACCAGT 

TGACTAGTTTTCTGAAACATCAATCTGGGGATCAGGCTAGTCAATATGCAG 

AAACGTAGCTATGAAGGACCTGGTTCAGAAGTTGGCAAGTAAAGAAGAATTGTCGACAGCAGACAATGAAATGAT 

TGAATTATTGGCTCGTTTTAATGAACGCCAAGCTTCCTTTTTTGGTCAGTTTTCGGTTAGA 

ATAAGCATGTAGCTAAGTATTTAAAAATCTTGCCAGACCAGTTTTCTTATCAGGCAATAGAAGATGT^ 

AGATGCTGAAAAAAATACITCCAATAATGAGATGGGAATGGAj^AATTATTTCT 

TTTGAAGAAATTAAAGGATTCTCAGAAAAGCTTTACCTATCTCAAGTCGCC^ 

TTAACACAGTTTTCTAAATCTAAGGTAAACCCGATTTTTATCATTCCACCTGTT^ 

CTGGTCTACGAGAGGATATGTACCAACAAACGGTGCAGAAGATTCGCTACCAGTTAGAAAGTCAAGGTTTTACCA 
ATATAGCAGArrTTTCTAAGGACGGCGGGGAGCCTTTCITTATGAAGGACACCATTCACCTT 
GTTGGCTTTTGACAAGGCAGTTGATCCITTCCTATCCAATCCCACACCAGCTCCGACITACCATCT 
TTTTTCAGCAAAGATTGGGCGACTTATGATGGAGATGTCAAAGAATTTCAATAG 

4169.6 

ATGGAGAAAAACCTCAAGGCTTTGAAACAAACAACAGACCAAGAAGGCCCAGCAATTGAACCTGAAAAGGCAGA 
GGATACCAAGACAGTCCAAAATGGTTACTTCGAGGATGCAGCTGTCAAGGACCGCACCTTGAGTGACTATGCAGG 
TAACTGGCAATCAGTTTATCCTTTCCTTGAAGACGGCACGTTTGACCAAGTCTTTGACT 

GGTAAGATGACCCAGGCTGAGTACAAGGCTTACTATACAuAAAGGCTATCATACAGATGTGACTAAGATTAACATT 
ACTGATAATACTATGGAATTTGTTCAAGGTGGACAAAGCAAGAAATACACTTACAAGTATGTCGGTAAGAAAATT 
TTGACTTACAAGAAAGGCAATCGTGGCGTGCGTTTCCTCITTGAAGCCACAGATC 
ATGTTCAGTTTAGTGACCACAATGTTGCCCCAGTTAAGGCAGAACATTTCCATATCnTCnTT 

AGAAGCCCTCTTrGAAGAAATGGACAACTGGCCAACCTACTACCCAGATAACCTATCTGGCCAAGAAATCGCCCA 
AGAAATGTTGGCGCATTGA 

4170.3 

ATGAAAGATGGTCATTTGCTAGCCCATCATATTCGTTTGTTGAATGGGCGGATTTTTCAAAAGTTACT 

ATCCTGAGGCTCTTTATAGGGGTGAACAGGGCAAGATTTTAGCGGTTTTATGGAATAGTGAAACT 

CTGCGACAGATATCGCGCTTGCGACTGGACTTGCGAATAATACGCTGACGACTATGATAAAAAAGCTAGAGGAAC 

AAAAGCTTGTAATTGTTAGTCCGTGTGGAAAAGACAAGCGTAAGAAGTATTTAGTTTTAACGGAG 

CCCAGAAAGAAGTGGGGCATCGTGTCAGTCAGAAATTGGATACTATCTTTTACAAAGGATTTTCAGAGGAAGAAA 

TTCACCAATTTGAAGGTTTTCAAGAAAGAATTTTGGCGAATCTGAAAGAGAAGGGAAATG 

4170.4 

ATGACTAATTTAATTGCAACTTTTCAGGATCGTTTTAGTGATTGGTTGAC^ 
GCTTTTGACCTTGTTACTAGCTATTTTGCTTG 

ACTGGGTCITGCAGATTGCAGGTATTTTCCAGACCATCCCGTCTCTGGCCITGTTGGGGCTCTT^ 
GGAATTGGGACCTTGCCGGCTTTGACAGCTCTAGTGATTTATGCGATTT^ 

GGCTGAAGGGAATTGATCCGAACCTGCAAGAGGCTGGGATTGCCTTTGGGATGACCAGATGGGAACGTCTCAAGA 

AATTTGAAATTCCACTCGCCATGCCTGTTATCATGTCTGGGATTCGGACGGCAGCTGTTTTGATTATCGGTACGGC 

AACCTTGGCGGCCnTGATTGGTGCAGGGGGACTAGGTTCCTTTATTCTTTTGGGAATTGACCGTAATAATGCCAGT 

TTGATTTTGATTGGGGCACTTTCnTCTGCAGTGCTAGCCATTGCCTTTAACTTCCT 

CAAAATTACGGACGATTTTCTCAGGTTTTGCCTTGGTGGCTTTATTACTGGGTCTGTCTTAT 

GTTCAAAAAGAGAAGGAAAACTTGGTTATTGCTGGGAAAATAGGTCCAGAACCAGAAATTT^ 

AAGTTGCTGATTGAAGAAAATACCAGCATGACTGCGACTGTTAAACCGAATT^ 

AAGCTCTGAAAAAAGGCGATATTGACATCTATCCTGAATTTACTGGTACGGTC 

ACCCAAGGTGAGTCATGAACCAGAACAGGTTTATCAGGTGGCGCGTGATGGCATTGCTAAGCAGGATCATCTAGC 

CTATCTCAAACCCATGTCTTATCAAAACACCTATGCTGTAGCTGTTCCGAAAAAGATTGCTCAAGAATATGGCT^ 

AAGACCATTTCAGACTTGAAAAAAGTGGAAGGGCAGTTGAAGGCAGGTTTTACACTCGAGTTTAACGACCGTGAA 

GATGGAAATAAGGGCTTGCAATCAATGTATGGTCTCAATCTCAATGTAGCGACCATTGAGCCAGCCCTTCGCTATC 

AGGCTATTCAGTCAGGGGATATTCAAATCACGGATGCCTATTCGACTGATGCGGAATTGGAGCGTTATGATTTACA 

GGTCTTGGAAGATGACAAGCAACTCTTCCCACCTTATCAAGGGGCTCCACTCATGAAAGAAGCTCTTCTCAAGAA 

ACACCCAGAGTTGGAAAGAGTTCTTAATACATTGGCTGGTAAGATTACAGAAAGCCAGATGAGCCAGCTCAACTA 

CCAAGTCGGTGTTGAAGGCAAGTCAGCAAAGCAAGTAGCCAAGGAGTTTCTCCAAGAACAAGGTTTGTTGAAGAA 

ATGA 
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4170.5 

ATGATGCATACTTATTTGCAAAAGAAAATTGAAAATATCAAAACAACCCT 
CGTATGGTTGCGGCTATGGCTGATTTAGGATTTTCAGGAACTATGAAGGCTA 
GTAGTTTTGCCCAGTGGATTTATTTGCTGGTTTTAGGAAGTTTTCCTCT 
5 ATTGTTGACTGGATTGGGATGATTTGTAGCTTGACAGGGATTATCTGTGTAATCTTTGTATCGGAAGGTCGAGCAA 

GTAATTATCTTTTTGGCTTGATTAACTCTGTT 

CTGACGACACTTTACTTCACAGTCATGCAGCCAATTGGACrrCTAGTTTGGATTTATCAGGCAC 
AAAAGCAGGAGTTTGTCGCGCGTAAACTGGACGGCAAGGGCTGGACAAAGTATCTTTCCATTAGTGTGCTTTGGT 
GGTTGGCCTTTGGCTTCATTTATCAGTCTATTGGTGCCAATCGTCCCTATCGTGATTCAATCACAGATGCAACCAA 
10 TGGGGTAGGGCAAATCCTCATGACAGCTGTTTACCGTGAACAGTGGATATTCTGGGCGGCTACCAATGT(nTTTCA 
ATCTATCTCTGGTGGGGAGAAAGCCTGCAAATTCAAGGGAAATATCTAATTTATCTCATTAACAGTCT 
GGTATCAATGGAGCAAGGCAGCTAAGCAGAATACTGATTTACTTAACTAG 

4170.6 

15 ATGAGAAATATGAAGGCAAAATATGCTGTTTGGGTGGCTTTTTTCnTA^ 
TGCAGGTGGAGTATTTGGTTCTAGCGCTGTTCTTGCTGACTCT^ 

ATATCAGCITTTCTAGAAACAATCTCCAATCGTGAAGAAGACAATCAGTACACCTTGGGCT 

TGCTAGGAGCCTTGGTAACAGCTGTGATTCTCGTAACGGGCTCTGTTCTAGTCATTTTGGAAAATGTCACGAAGAT 
TTTGCATCCGCAACCAGTCAATGATGAGGGGATTCTCTGGTTAGGAATTATTGCGATTACTATCAATCTGTTAGCG 
20 AGTCTGGTGGTTGGTAAGGGAAAGACAAAGAATGAGTCTATTCTGAGTCTGCAT 
TGGGTAGCTGTTATCCTGATGGCGATTGTTCTTCGATTTACGGACTC 
CATTTCTTTCTTTATTCTTTCAAAAGCCCITCCACGTTTTTGGTCT 

AAGGTCTTGATATCAAGCAAGTAAAGAGTGGCCTGGAGCGATTGGACAATGTGGCCAGCCTTAATCAGCTTAATC 
TCTGGACTATGGATGCTTTGGAAAAAAATGCCATTGTCCATGTTTGTCTAAAAGAAATGGAACATATGGAAA 
25 TAAAGAGTCTATTCGAATTTTCCTAAAAGATTGTGGTTTTCAAAATATTACCAT^ 

ACTCACCAAACCCATAAGCGAAAGGTGTGTGACTTGGAACGGAGTTATGAGCATCAACATTAG 

4170.8 

ATGATTGAATACAAAAATGTAGCACTGCGCTACACAGAAAAGGATGTCTTGAGAGATGTCAACTTACAGATTGAG 
30 GATGGGGAATTTATGGTTTTAGTAGGGCCTTCTGGGTCAGGTAAGACGACCATGCTCAAGATGATTAACCGTCTTT 
TGGAACCAACTGATGGAAATATTTATATGGATGGGAAGCGCATCAAAGACTATGATGAGCGTGAACTTCGTCTTT 
CTACTGGTTATGTTTTACAGGCTATTGCTCITTTTCCAAATCT 

ATGAAGGGGTGGAGCAAGGAAGAAATTACGAAGAAAACAGAAGAGCTTTTGGCTAAGGTTGGTTTACCAGTAGCC 
GAGTATGGGCATCGCTTACCTAGTGAATTATCTGGTGGAGAACAGCAACGGGTCGGTATTGTCCGAGCTATGATTG 
35 GTCAGCCCAAGATTTTCCTCATGGATGAACCCTTTTCGGCCnTGGATC 
GACAAAAGAATTGCATAAAGAGTTTGGGATGACAACGATTTTTC 

GGCGGACCGTATTGCTGTCnTGCAGGATGGAGAAATTCGCCAGGTAGCGAATCCCGAGACAATTTTAAAAGCGCC 
TGCAACAGACTTTGTAGCAGACTTGTTTGGAGGTAGTGTTCATGACTAA 

40 4171.1 

ATGTCAGCAGTTGCTATTTCAGCTATGACCAAGGTTATGCAAGAAACCCACGGAAATCCITCTAGTATTCATG 
ATGGTCGTCAAGCTGGCAAACTCTTGCGAGAAGCCCGTCAGGAACTAGCCCAGTTACTAAGGACAAAACCTCAAC 
ATATCTTTTTCACTTCTGGTGGGACTGAAGGCAATAATACTACCATCATTGGCTACTGTCTTCGTCACCAAGAACA 
AGGAAAACATATCATCACAACTGCCATCGAGCACCATGCTGTCCTTGAAACAATTGATTACTTGGTTCAACACT^ 

45 GGGTTTGAAGCAACCATTATCCAGCCAGAAAATCAAGAAATCACAGCCCAGCAAATTCAAAAGGCTTTACGTGAC 
GATACGATTTTGGTTTCTACCATGTTTGTCAATAATGAGACAGGAAACCTACTGCCCATCGCTGAAATTGGCCAAA 
TACTCAAGCAACACCCTGCTGCCTATCATGTTGATGCAGTTCAGGCTATTGGTAAAATCCCAATTCATTCAGAAGA 
ATTGGGCATTGATTTTCTCACrGCTTCTGCCCACAAATTCCATGGTCCTAAGGGAATCGGTT^ 
GCATGGACTTTGATTCCTATCTACATGGCGGAGACCAGGAACAGAAAAAACGTGCAGGAACTGAAAATCTGCCTG 

50 CCATTGTAGGCATGGTTGCAGCCCTAAAAGAAGACCTAGAAAAACAAGAAGAACATTTTCAACATGTACAAAATC 
TAGAAACTGCCTTTCTGGCAGAGCTGGAGGGCATTCAGTATTACCTGAATAGAGGAAAACATCATCTCCCTTATGT 
TCTCAATATTGGATTTCCTGGTCAGAAAAATGACCTCTTACTC 

CTGGCTCAGCCTGTACTGCAGGCGTTGTCCAATCCAGCCATGTTCTTGAAGCCATGTATGGCGCAAATTCAGAACG 
(HTGAAGGAATCCCTTCGCATCAGTTTGTCGCCACAAAATACCGTTGAAGACCT^ 
55 AAAGAAATTATCGGAGGTTAG 



60 4172.1 

ATGTTATTCAAATTATCTAAGGAAAAAATAGAGCTAGGC™^ 
GTTTTGCCTTGGTCATTTTACTAGGCTCTCTTCTTTTGAGCTTGCCCTTTG 

TATTTTGATCATCTTTTCACTGCTGTCTCTGCAGTCTGTGTGACGGGTCTCTCAACCCTTCCAGTAGCTCACACCTA 
TAATATCTGGGGTCAAATAATCTGTTTGCTCTTGATTCAGATCGGTGGTCTAGGGCTCATGACCTTTATTGGGGTTT 
65 TCTATATCCAGAGCAAGCAAAAGCTTAGTCTTCGTAGCCGTGCAACTATTCAGGATAGTTTTAGTTATGGAGAAAC 
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TCGATCTTTGAGAAAGTTTGTCTATTCTATTTTTCT 
GTTTTCGCCTTATTCCTCAACTTGGCTGG 

AATGCCGGTTTTGATAATTTAGGGAGCACCAGTTTATTTGCTTTTCAGACC 
AGGCTTGATTATTACAGGCGGCCTTGGTTTTATGGTCTGGTTTGA 
5 GGACGTCTGCACTTTCATACGAAGCTTGTACTATTATTGACTATAGGTTTGTC 
CTTTCTTGAGTGGAACAATGCTGGAACGATTGGCAATCTCCCT^ 
ACAGTGACGATGCGAACAGCTGGCTTTTCTACGATAGATTATACTCAGGCTCATC 

TCTTACAGATGTTTCTAGGTGGGGCACCTGGAGGAACAGCTGGGGGACTCAAGATTACGACATTTTTTGTCCTCTT 
GGTCTTTGCACGAAGTGAGCTTCTAGGCTTGCCTCATGCCAATGTTGCGAGACGAACGATCGCGCCGCGAACGGTT 
10 CAAAAATCCITTAGTGTCTTTATTATCITTTTGATG 

AGGCAATCCTCCCTTTATCCACCTCGTATTTGAAACCAT^ 

ACTCCTGACCTTGGGAAATTGGCTCTCAGTGTTATCATGCCACTTATGTTTATGGG 

ACGAATTGGTCCCTTGACCTTGTTTGTTAGCTTGGCAGATTACCATCCAGAAAAGAAAGATATGATTCACTATATG 
AAAGCAGATATTAGTATTGGTTAA 

15 

4172.2 

ATGTCAGATCGTACGATTGGAATTTTGGGCTTGGGAATTTTTGGGAGCAGTGTCCTAGCTGCCCTAGCCAAGCAGG 
ATATGAATATTATCGCTATTGATGACCACGCAGAGCGCATCAATCAGTTTGAGCCAGTTTTGGCGCGTGGAGTGAT 
TGGTGACATCACAGATGAAGAATTATTGAGATCAGCAGGGATTGATACCTGCGATACCGTTGTAGTCGCGACAGG 

20 TGAAAATCTGGAGTCGAGTGTGCTTGCGGTTATGCACTGTAAGAGTTTGGGGGTACCGACTGTTATTGCTAAGGTC 
AAAAGTCAGACCGCTAAGAAAGTGCTAGAAAAGATTGGAGCTGACTCGGTTATCTCGCCAGAGTATGAAATGGGG 
CAGTCTCTAGCACAGACCATTCTTTTCCATAATAGTGTTGATGTCTTTCAGTTGGATAAAAATGTGTCTATCGTGG 
AGATGAAAATTCCTCAGTCITGGGCAGGTCAAAGTCTGAGTAAATTAGACCTCCGTGGCAAATACAATCTGAATA 
TTTTGGGTTTCCGAGAGCAGGAAAATTCCCCATTGGATGTTGAATTTGGACCAGATGACCTCTTGAAAGCAGATAC 

25 CTATATTTTGGCAGTCATCAACAACCAGTATTTGGATACCCTAGTAGCATTGAATTCGTAA 

4172.3 

ATGAAGTTATTGTCTATCGCAATTTCTAGCTATAATGCAGCAGCCTATCTTCATTACTGTGTGGAGTCGCTAGTGA 
TTGGTGGTGAGCAAGTTGGGATTTTGATTATCAATGACGGGTCTCAGGATCAGACTCAGGAAATCGCTGAGTGTTT 
30 AGCTAGCAAGTATCCTAATATCGTTAGAGCCATCTATCAGGAAAATAAATGCCATGGCGGTGCGGTCAATCGTGG 
(HTGGTAGAGGCnTCTGGGCGCTATTTTAAAGTAGTTGACAGTGATGACT 
ATTCTTGAAACCTTGCAGGAACTTGAGAGCAAAGGTCAAGAGGTGGATGTCTTTGTGAC 

AGGAAGGGCAGTCTCGTAAGAAGAGTATGAGTTACGATTCAGTCTTGCCTGTTCGGCAGATTTTTGGCTGGGACCA 
GGTCGGAAATTTCTCCAAAGGCCAGTATACCATGATGCACTCGCTGATTTATCGGACAGATTTGTTGCGTGCTAGC 
35 CAGTTCTAA 

4172.4 

ATGAAATTCAATCCAAATCAAAGATATACTCGTTGGTCTATTCGCCGTCTCAGTGTCGGTGTTGCCTCAGTTGTTG 
TGGCTAGTGGCTTCTTTGTCCTAGTTGGTCAGCCAAGTTCTGTACGTGCCGATGGGCTCAATCCAACCCCAGGTCA 
40 AGTCTTACCTGAAGAGACATCGGGAACGAAAGAGGGTGACTTATCAGAAAAACCAGGAGACACCGTTCTCACTCA 
AGCGAAACCTGAGGGCGTTACTGGAAATACGAATTCACTTCCGACACCTACAGAAAGAACTGAAGTGAGCGAGGA 
AACAAGCCCITCTAGTCTGGATACACTTTTTGAAAAAGATC 

CTTAAAAGAAACTGTAGATACAGCTGATGTGGATGGGACACAAGCAAGTCCAGCAGAAACTACTCCTGAACAAGT 
AAAAGGTGGAGTGAAAGAAAATACAAAAGACAGCATCGATGTTCCTGCTGCTTATCTTGAAAAAGCTGAAGGGAA 
45 AGGTCCTTTCACTGCCGGTGTAAACCAAGTAATTCCTTATGAACTATTCGCTGGTGATGGTATGTTAACTCG 
TTACTAAAAGCTTCGGATAATGCTCCTTGGTCTGACAATGGTACTGCTAA^^ 

GATTAACAAAAGGGAAATACTTCTATGAAGTAGACTTAAATGGCAATACTGTTGGTAAACAAGGTCAAGCTTT^ 

TTGATCAACTTCGCGCTAATGGTACTCAAACTTATAAAGCTACTGTTAAAGTTTACGGAAATAAAGACGGTAAAGC 

TGACTTGACTAATCTAGTTGCTACTAAAAATGTAGACATCAACATCAATGGATTAGTTGCTAAAGAAACAGTTCAA 

50 AAAGCCGTTGCAGACAACGTTAAAGACAGTATCGATGTTCCAGCAGCCTACCTAGAAAAAGCCAAGGGTGAAGGT 
CCATTCACAGCAGGTGTCAACCATGTGATTCCATACGAACTCTTCGCAGGTGATGGCATGTTGACTCGTCTCTTGC 
TCAAGGCATCTGACAAGGCACCATGGTCAGATAACGGCGACGCTAAAAACCCAGCCCTATCTCCACTAGGCGAAA 
ACGTGAAGACCAAAGGTCAATACTTCTATCAAGTAGCCTTGGACGGAAATGTAGCTGGCAAAGAAAAACAAGCGC 
TCATTGACCAGTTCCGAGCAAATGGTACTCAAACTTACAGCGCTACAGTCAATGTCTATGGTAACAAAGACGGTA 

55 AACCAGACTTGGACAACATCGTAGCAACTAAAAAAGTCACTATTAACATAAACGGTTTAATTTCTAAAGAAACAG 
TTCAAAAAGCCGTTGCAGACAACGTTAAAGACAGTATCGATGTTCCAGCAGCCTACCTAGAAAAAGCCAAGGGTG 
AAGGTCCATTCACAGCAGGTGTCAACCATGTGATTCCATACGAACTCTTCGCAGGTGATGGTATGTTGACTCGTCT 
CTTGCTCAAGGCATCTGACAAGGCACCATGGTCAGATAACGGTGACGCTAAAAACCCAGCCCTATCTCCACTAGG 
TGAAAACGTGAAGACCAAAGGTCAATACTTCTATCAATTAGCCTTGGACGGAAATGTAGCTGGCAAAGAAAAACA 

60 AGCGCTCATTGACCAGTTCCGAGCAAACGGTACTCAAACTTACAGCGCTACAGTCAATGTCTATGGTAACAAAGA 
CGGTAAACCAGACTTGGACAACATCGTAGCAACTAAAAAAGTCACTATTAACATAAACGGTTTAATTTCTAAAGA 
AACAGTTCAAAAAGCCGTTGCAGACAACGTTAAGACAGTATCGATGTTCCAGCAGCCTACCTAG 
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ATGAAACTAAAAAGTTATATTTTGGTTG 

TCAAAAAATGCTGATTGCGAAAGGCGAGATTTACTTTTTC 

GGGATTAGTCTCTTTCTCCTATTGCCAGTCTTTACGTC^ 

CCAAGGATTTTCCTTCAAATTTGGAGGTTCAAGGTCCTGTAGAATTTCAGCAATT 
5 GTCCCATGATTTGCAGGTAAGCTTTGATTCCTTGGAAGAAAGCGAAC 

GTCGCATGATATTAAGACTCCTATCACTTCGATCCAAGCGACGGTAGAAGGGATTTTGGATGGGATTATCAAGGA 
GTCGGAGCAAGCTCATTATCTAGCAACCATTGGACGCCAGACGGAGAGGCTCAATAAACTGGTTGAGGAGTTGAA 
TTTTTTGACCCTAAACACAGCTAGAAATCAGGTGGAAACTACCAGTAAAGACAGTATTTTTCT 
ATTGAGTGCATGAGTGAATTTCAGTTTTTGATTGAGCAGGAGAGAAGAGATGTCCACTTGCAGGTAATCCCAGAGT 
10 CTGCCCGGATTGAGGGAGATTATGCTAAGCTTTCTCGTATCTTGGTC 

TGCTCCAGGAACCAAGCTGGAAGTGGTGGCTAAGCTGGAGAAGGACCAGCTTTCAATCAGTGTGACCGATGAAGG 
GCAGGGTATTGCCCCAGAGGATTTGGAAAATATTTTCAAACGCCTTTATCGTGTCGAAACTTCGCGTAACATGAAG 
ACAGGTGGTCATGGATTAGGACITGCGATTGCGCGTGAATTGGCCCATCAATTGGGTGGGGAAATCACAGTCAGC 
AGCCAGTACGGTCTAGGAAGTACCTTTACCCTCGTTCTCAACCTCTCTGGTAGTGAAAATAAAGCCTAA 

15 

4172.6 

ATGTTTGGTCAAACGGCTCAACATGGTCTTACGAATAGCCTGA 

CACAATTGGCGTTTTTTTGCCAGATGCTCCGCTGTTCCAGATCGGTTGAGCAGGGTACTGGAAATCACCGTCGTGA 
GTTCAATATGATTCAGCAGATATTCTCGCATTTTGGGATGACTCACITGGGACAAATCAAGTTGG 
20 TCGATTGACCTTGAGTTGCTGGTCAATGCACTTAATCATCACTTGCTCATTGACAGACT 

AAATAACGATAGAAATCGACAGGCAGATAGTACATGGTCTTGACCTGCTGAAGGGGCGTAAAGACAAAGAGATTA 

TCGACATAAAAAGTATGTTCAGGCAGTTAGAACTGGCTAGCACGCAACAAATCTGTCCGATAAATCAGCGAGTGC 

ATCATGGTATACTGGCCTTTGGAGAAATTTCCGACCTGGTCCCAGCCAAAAATCTGCCGAACAGGCAAGACTGA 

25 4174.1 

ATG G AAC ATTT AG C AACTT ATTTTTC A A C CT ATG G AG G AG CTTTCTTCG CTG C ATTG GG AATTGT ATTG G CG GTTG 
GATTAAGCGGTATGGGGTCTGCTTATGGAGTTGGTAAGGCTGGGCAATCTGCCGCAGCTTTACTGAAAGAACAGC 
CTGAAAAGTTTGCCTCAGCTTTGATATTGCAATTATT^ 

TTTAATTTGGTTGCAATTAACTCCAGAACITCCTTTAGAAAAAGGCGTTGC^ 
30 CT ATTGTAGGATACTTTTCAGCTAAGCATCAAGGAAATGTAGCAGTAGCGGGAATGCAAATCTTGGCTAAAAGAC 

CAAAAGAATTCATGAAGGGAGCAATTTTAGCTGCCATGGTAGAAACCT 
CATTTTGACCCTTCGTGTATAA 
4175.2 

ATGTTAAAATCAGAAAAACAATCACGTTATCAAATGTTAAATGAAGAATTGTCCTTCCTATTGGAAGGCGAAACC 

35 AATGTTTTGGCTAATCTTTCCAACGCCAGTGCTCTCATAAAA^ 

TTTGTTCGATGGAAAGGAATTGGTTTTAGGCCCCTTCCAAGGAGGTGTTTCCTGCATCCGTATTGCACTAGGCAAG 
GGTGTTTGTGGTGAGGCAGCTCACTTTCAGGAAACTGTTATTGTTGGAGATGTGACGACCTATCTCAACT 
CTTGTGATAGTCTAGCTAAAAGTGAAATTGTGGTGCCGATGATGAAGAATGGTCAGTTACTTGGAGTTCTGGATCT 
GGATTCTTCAGAGATTGAGGATTACGATGCTATGGATCGAGATTATTTGGAACAATTTGTCGCr^ 

40 AAGACAGCATGGGACTTTACGATGTTTGAGGAAAAATCTTAA 

4175.3 

ATGTCAGTATTAGAGATCAAAGATCTTCACGTTGAGATTGAAGGAAAAGAAATTTTAAAAGGGGTTAACCTGACC 
CTGAAAACAGGAGAAATTGCCGCTATCATGGGACCAAATGGTACAGGTAAATCGACTCTTTCTGCCGCTATCATG 

45 GGAAATCCAAACTATGAAGTAACTAAAGGTGAAGTTTTGTTTGATGGCGTAAACATCCTTGAGTTGGAAGTGGAT 
GAGCGTGCGCGTATGGGACTTTTCCTTGCTATGCAATACCCATCAGAAATCCCTGGAATTACCAATGCTGAGTTTC 
TTCGTGCCGCTATGAATGCGGGTAAAGAAGATGATGAGAAGATTTCAGTTCGTGAGTTTATTACTAAGCTAGATGA 
AAAAATGGAATTGCTCAACATGAAAGAAGAAATGGCAGAGCGTTACCTCAACGAAGGCTTCTCTGGTGGTGAGAA 
AAAACGCAATGAAATTCTTCAACITTTGATGTTGGAGCCAACATTTGCTCTTTTGGACGAGAT^ 

50 GATATTGACGCTCTTAAAGTTGTGTCTAAAGGTGTCAATGCCATGCGTGGTGAAGGTTTTGGTGCTATGATCATCA 
CTCACTACCAACGTCTTTTGAACTATATCACACCTGATGTGGTACACGTGATGATGGAAGGTCGTGTTGTCCTTTC 
TGGTGGTCCAGAATTGGCTGCGCGTTTGGAACGTGAAGGATACGCAAAATTAGCTGAAGAACTTGGCTACGACTA 
CAAGGAAGAATTGTAA 

55 4174.4 

ATGCCCTACAAAAGACAAAGGAGTTTTTCAATGGCACTTTCTAAACTAGATAGCCTTTATATGGCAGTGGTAGC 
ACCATTCGAAAAATCCACATCACCAAGGGAAGTTAGAAGATGCTGAGCAAATCAGTCTCAACAATCCGACTTGTG 
GGGATGTCATCAACCTCTCTGTCAAGTTTGATGCAGAGGACCGTTTGGAAGATATTGCTTTTCTAAATTCAGGATG 
CACGATTTCAACTGCTTCTGCTAGTATGATGACAGATGCCGTTTTAGGA 
60 GCGACTATTTTTTCTGAAATGGTTCAAGGGCAAAAAGATGAGCGTCAAGACCAACTTGGAGACGCGGCATTCTTG 
TCAGGTGTTGCCAAATTCCCTCAAAGAATCAAGTGTGCAACCCTAGCTTGGAATGCCCTTAAGAAAACAATTGAA 
AATCAAGAAAAACAGTAA 
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ATGAAAATTCAAGACCTATTGAGAAAAGATGTCATGTTGCTAGATTTGCAGGCAACTGAAAAAACAGCTGTCATC 
GACGAGATGATTAAAAATTTGACAGACCACGGTTATGTAACAGATTT^ 

CGTGAAGCTTTGACTTCTACTGGTTTGGGTGATGGAATCGCAATGCCTCACAGCAAAAACGCT 

GCGACAGTTCTATTTGCTAAGTCAAATAAGGGTGTTGACTACGAGAGCTTGGATGGACAAGCAACT 
5 TCATGATTGCAGCTCCAGAAGGTGCCAATGATACTCACITGGCAGCCTTGGCAGAATTGTCrCAATAC^ 

AGACGGTTTTGCAGACAAACTTCGTCAAGCAACATCTGCAGA^^ 

AAAACTGAGGAACITGTTCAAGCACCTGCTAATGACTCTGGTGACTTTATCGTAGCT 

GTATTGCCCACACnTACATGGCCCAAGAAGCCCTTCAAAAAGTA 

AAACCAACGGTGCTAGCGGTGTTGGAAATCAACTAACTGC^^ 
10 CAGCAGACAAGGCCGTTGAAATGGATCGATTTGATGGAAAACCATTGATCAATCGTCCAGTTGCTGACGGTATCC 

GTAAGACAGAAGAGCTAATTAACTTGGCTCTTTCAGGAGATACTGAAGTCTACCGTGCCGCTAATGGTGC 

CTGCAACAGCCTCTAACGAAAAACAAAGCCTTGGTGGTGCCTTGTACAAACACTTGATGAGTG 

TGTTACCATTCGTTATCGGTGGTGGTATCATGATTGCCCTTGCCT^ 

GAAAACCTTGGCAATCTTGGTTCTTACCATGAGTTAGCrrCTATGTTCATGAA^ 
15 GATGCTTCCAGTCTTTGCGGGTTATGTTGCCTACTCT 

GGTGCTATTGCCAAAGAAGGTTTTGCCTTTGGTAAAATTCCTTATGCCGCAGGTGGTGA^ 

CAGGTGTCTCATCHX3GTTTCCTAGGTGCCCTTGTTGGTGG 

TACGTTAAAGTTCCTCGTTCACTCGAAGGTGCTAAATCAATCCTTCTATTGCCACTTCTTGGAACA^ 
GATTTGTTATGCTAGCTGTGAATATCCCAATGGCTGCAATCAACA 
20 AGGAGGTTCAGCTGTCCTTCITGGTATCGTCCITGGTGGAATGATGGCTGTTGACATGGGTGGACCAGTTAATAAA 
GCAGCITATGTCITTGGTACAGGTACGCTTGCAGCAACTGTTTCTTCAGGTGGTTCT 
TGGCTGGAGGAATGGTGCCACCACnTGCAATCTTTGTCGCAACTCTTCTTTTCAAAGATAAA 

ACGTAACTCTGGTTTGACAAACATCATCATGGGCTTGTCATTTATCACTGAGGGAGCGATTCCATTTGGTGCCGCT 
GACCCAGCTCGTGCGATTCCAAGCTTCATCCTTGGTTCAGCAGTAGCAGGTGGACTCGTTGGTCTTACTGGTATCA 
25 AACTCATGGCGCCACACGGAGGAATCTTCGTTATCGCCCTTACTTCAAATGCTCTCCTTTAC 
GTAGGAGCAATCGTAAGTGGTGTGGTTTATGGTTACCTACGCAAACCACAAGCATAA 

4175.6 

ATGGCAAACAAGAATACAAGTACAACAAGACGGAGACCGTCTAAAGCAGAACTGGAAAGAAAAGAAGCGATTCA 
30 ACGAATGTTGATTTCGTTAGGAATTGCGATTTTATTGATTTTCGCAGCCTTCAAATTAGGGGCT 
CTTTATAATTTAATTCGCTTGCTAGTGGGTAGCCTAGCTTATCTGGCGATATTCGGCCT 
TTTCAAGTGGATACGAAAACAGGAAGGACTCrTATCTGGCTTTTTCACCATATTTGCT 

AGGCCTACnTGGTTTGGAAATATGGTTTGGACAAGTCCGTTCTAAAAGGGACCATGGCTCAGGTTGTGACAGATCT 
GACTGGTTTTCGAACGACTAGCTTTGCTGGAGGGGGCTTGATCGGGGTCGCT 

35 TTTTCAAATATCGGAACrrTACTTTATTGGTTCTATCTTGATTTTAGTGGGTTCT 

TACGATATTGCTGAATTTTTCAGTAGAGGCTTTGCCAAATGGTGGGAAGGGCACGAGCGTCGAAAAGAGGAACGC 
TTTGTCAAACAAGAAGAAAAAGCTCGCCAAAAGGCTGAGAAAGAGGCTAGATTAGAACAAGAAGAGACTGAAAA 
AGCCTTACTCGATTTGCCTCCTGTTGATATGGAAACGGGTGAAATTCTGACAGAGGAAGCTGTTCAAAA 
CCTATTCCAGAAGAAAAGTGGGTGGAACCAGAAATCATCCTGCCTCAAGCTGAACTTAAATTCCCTGAACAGGAA 

40 GATGACTCAGATGACGAAGATGTTCAGGTCGATTTTTCAGCCAAAGAAGCCCTTGAATACAAACTTCCAAGCT^ 
CAACTCTTTGCACCAGATAAACCAAAAGATCAGTCTAAAGAGAAGAAAATTGTC 

GAAGCAACCnTTGCTAGCTTTGGTATTAAGGTAACAGTTGAACGGGCCGAAATTGGGCCATCAGTGACCAAGTAT 
GAAGTCAAGCCGGCTGTTGGTGTAAGGGTCAACCGCATTTCCAATCTATCAGATGACCTCGCTCTAGCCITGGCT 
CCAAAGATGTCCGGATTGAAGCACCAATCCCTGGGAAATCCCTAATCGGAATTGAAGTGCCCAACTCCGATATTG 
45 CCACTGTATCTTTCCGAGAACTATGGGAACAATCGCAAACGAAAGCAGAA^ 

AGGCTGTTAATGGAACCGCAAGAGCTTTTGACCTTTCTAAAATGCCCCACTTGCTAGTTG 

AGGGAAGTCAGTAGCAGTTAACGGCATTATTGCTAGCATTCTCATGAAGGCGAGACCAGATCAAGTTAAATTTAT 
GATGGTCGATCCCAAGATGGTTGAGTTATCTGTTTACAATGATATTCCCCACCTCTTGATTCCAGTCGTGACCAAT 
CCACGCAAAGCCAGCAAGGCTCTGCAAAAGGTTGTGGATGAAATGGAAAACCGTTATGAACTCnTTGCCAAGGTG 

50 GGAGTTCGGAATATTGCAGGTTTTAATGCCAAGGTAGAAGAGTTCAATTCCCAGTCTGAGTACAAGCAAATTCCG 
CTACCATTCATTGTCGTGATTGTGGATGAGTTGGCTGACCTCATGATGGTGGCCAGCAAGGAAGTGGAAGATGCTA 
TCATCCGTCTTGGGCAGAAGGCGCGTGCTGCAGGTATCCACATGATTCTTGCAACTCAGCGTCCATCTGTTGATGT 
CATCTCTGGTTTGATTAAGGCCAATGTTCCATCTCGTGTAGCATTTGCGGTTTCATCAGGAACAGACTCCCGTACG 
ATTTTGGATGAAAATGGAGCAGAAAAACITCTTGGTCGAGGAGACA 

55 CCAGTTCGTCTCCAAGGCTCCTTTATCTCGGATGACGATGTTGAGCGCATTGTGAACTTCATCAAGACTCAGGCAG 
ATGCAGACTACGATGAGAGTTTTGATCCAGGTGAGGTTTCTGAAAATGAAGGAGAATTTTCGGATGGAGATGCTG 
GTGGTGATCCGCTTTTTGAAGAAGCTAAGTCTTTGGTTATCGAAACACAGAAAGCCAGTGCGTCTATGATTCAGCG 
TCGTTTATCAGTTGGATTTAACCGTGCGACCCGTCTCATGGAAGAACTGGAGATAGCAGGTGTCATCGGTCCAGCT 
GAAGGTACCAAACCTCGAAAAGTGTTACAACAATAA 

60 

4176.1 

ATGAGTTATTTTAAAAAATATAAATTCGATAAATCCCAG 
CTGTTTTTCTAGTTCTCTTGATTTTTGGCTTTTTTGGCT 

GCCTGAGGGAGAGTTTTGATGAGAGTGTTCATTTTGGGACTTCGCGTATTCTAGGAAATAGTATCGGTGGACTCTA 
65 TGCCTTGGTCTTCTTCITATTAAATACCTTTTTCCACGAAGCCTTTTGGGTG 
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CCATGTTAACCATTATGACAAATGTAGCCATGAATAACAAAGCAGGGGTTATTGGTGGTGTAGCAGCTATGTTAAT 

CATTACCCTATCAATTCCAAGTGGTGAGACAATTTTGTACGTGTTTGTGCGTGTATTAGAAACGTTTATGGGAGTT 

TTTGTCGCAATTATCGTAAATTACGATATTGATCGTATTCGTCTCTTTTTAGAGAAAAAAGAAAAATAA 

4178.2 

ATGAATAAATCAGAACACCGCCACCAACTTATACGCGCTCTTATCACAAAAAACAAGATTCATACACAGGCTGAG 

TTGCAAGCCCTTCTTGCTGAGAACGACATTCAAGTAACCCAGGCAACCCTCTCACGCGACATCAAAAATATGAAC 

CTATCAAAAGTCCGCGAAGAAGATAGCGCTTATTATGTTCTTAACAATGGTTCCATCTCAAAATGGGAAAAACGTC 

TCGAACTCTACATGGAAGACGCCCTTGTCTGGATGCGCCCAGTTCAACACCAAGTCCTACTAAAAACCCTTCCTGG 

ACTGGCTCAATCCTTTGGTTCTATCATTGATACTTTGAGCTT^ 

TCTGTCTTATCATCTGTGAAGATGCAGATACTGCTCAAAAGTGCTTTGAAGAACTGAAAAAATTCGCCCCACCATT 
TTTCTTTGAAGAATAA 

4179.1 

ATGAAAAGTATAAAATTAAATGCTCTATCTTACATGGGAATTCGTGT 
AACCTATGTCGCGCGTGTCTTGGACCGAACTGACTATGGTTACTTCAACTCAGTCGAC^ 

TGCCCTTTGCAACTTATGGTGTCTATAACTACGGTTTAAGGGCTATCAGTAATGTCAAGGATAACAAAAAAGATCT 

TAACAGAACClTl rCTAGTCTT rnTATTTGTGCATCGCTTGTACGATTTTGACCACrGCTGTCTATATCCTAGCCT 

ATCCTCTCTTCTTTACTGATAATCCAATCGTCAAAAAGGTCTACCTTGTTATGGGGATTCAACT 

TTTTCAATCGAATGGGTCAATGAAGCTCTGGAAAATTACAGTTTTCTCITTTA 

GATGCTGGTCTCTATTTTCTTATTTGTTAAAAATGAACACGATATTGTTGTCTATACACTTGTGAT 

CGCTGATTAACTACCTGATTAGTTATTTTTGGATTAAAAGAGACATCAAACTTGTTAAAATTC 

TAAACCACTCTTTCTCCCTCTGACAGCCATGTTAGTCTTTGCC 

TCITCCTCGTTAAAACAGGGATTGATGTCAACGTTAGTTACTATACCATAGCTCAGCGAATTGTGACCGTTATA 

TGGGGTTGTAACAGGTGCAATTGGAGTGAGTGTGCCTCGTCTCAGTTACTATCTGGGGAAAGGAGACAAAGAAGC 

CTATGTTTCTCTGGTTAATAGAGGTAGTCGAATCTTTAACTTCTTTATCATTCCACTG 

TAGGACCAAATGCCATCCTACITTACGGTAGTGAAAAATATATCGGAGGCGGCATCITGACCrCT 

TCGTACGATTATCCTGGCCTTAGATACCATTCTTGGTTCCCAAATTCTC^^ 

ACAGTCTATACAGTCTTTGCTGGGCTACTCAATTTGGGCTTGAATAGTCTCCTTTTTTT 

TGAATACTACTTACTGACAACTATGCTATCAGAGACTTCTCTACTTGTTTTCTATATCATTTTCATCC 

AACTCATCCACTTGGGACATATCTTTAGCTATACTGTTCGATACT^ 

TC CTG ATT A ATTT CGTGT ATCCTGT AG AT ATGGTC ATT AATTTG C C ATTTTTG ATT A 

ATACTGGTTTGATTGTCITGCTATCAGCTATCTCTTATATTAGTCTACr^ 

G AATTTTT A AACC ATGTCCT AG C CIT A AAAAAT AAATTT AAAAAATC AT AG 

4179.2 

ATGAAACAACTAACCGTTGAAGATGCCAAACAAATTGAATTAGAAATTTTGGATTATATTGATACTCT 

AGCACAATATCAACTATATTATTAACTACGGTACTCTGATTGGGGCGGTTCGACATGAGGGCTTTATCCCTTGGGA 

CGACGATATTGATCTGTCCATGCCTAGAGAAGACTACCAACGATTTATTAACATTTTTCAAAAGGAAAAAAGCAA 

GTATAAGCTCCTATCCTTAGAAACTGATAAGAACTACTTTAACAACTTTATCAAGATAA^ 

ATTATTGATACTCGAAATACAAAAACCTATGAGTCTGGTATCTTTATCGATATTTT^ 

TCCTAAGGTCATTGATACTTGTTATAAACTGGAAAGCTTCAAACTGCTGTCTTTCAGTAAACATAAAA 

TATAAGGATAGCCTTTTAAAAGATTGGATACGAACAGCCTTCTGGTTACTCCTTCGACCGG 

TGCAAATAAAATCGAGAAAGAAATTCAAAAATATAGTCGTGAAAATGGGCAATATATGGCTTTTATCCCTTCAAA 
ATTTAAGGAAAAGGAAGTCITCCCAAGTGGTACCTTTGATAAAACAATCGATrTACCCTT^ 
CCTGCACCTGAAAAATTTGATACTATriTGACACAATTTTATGGAGATTATATGACCCTACCA 
GCTTCTACAGTCATGAATTTCACGCTTATAAATTGGAGGATTAG 

4179.3 

ATGATAAAAATCAATCATCTAACCATCACACAAAACAAAGATTTACGAGATCTTGTATCTGACCTAACCATGACC 
ATCCAAGACGGGGAAAAGGTTGCTATTATTGGTGAAGAAGGAAATGGCAAATCAACCITACTTAAAATTrTAATG 
GGGGAAGCTTTGTCTGATTTCACTATCAAGGGAAACATCCAATCTGACTATCAGTCACTGGCCTACATTCCTCAAA 
AAGTCCCTGAGGACCTAAAAAAGAAAACITTACACGACTACTTCriTTTAGATTCT 

CCTCTATCGTTTGGCGGAGGAATTGCATTTTGATAGCAATCGTTTCGCAAGTGACCAAGAGATTGGCAATCTATCA 
GGGGGCGAAGCTTTGAAAATTCAGCTTATCCATGAGTTAGCCAAACCCTTTGAGATTCTA 

CAAATGACCTAGACCTTGAGACAGTTGATTGGCTAAAAGGCCAGATTCAAAAGACCAGGCAAACCGTTATTTTCA 
TTTCCCATGATGAAGACTTTCITTCrGAAACGGCAGACACTATTGTTCACTTGCGAC 

AGCGGAAACGCTAGTAGAGCATTTAGACTATGATAGCTATAGTGAGCAGAGAAAGGCTAATTTTGCCAAACAAAG 

TCAGCAAGCTGCTAACAACCAAAGAGCCTACGATAAAACCATGGAAAAACATCGGAGAGTTAAGCAAAATGTAG 

AAACTGCGCTTCGAGCTACCAAAGATAGTACTGCCGGTCGCCTATTGGCTAAAAAGATGAAAACTGTCCTCTCAC 

AAGAAAAACGCTACGAAAAGGCAGCTCAGTCCATGACTCAAAAGCCACTTGAAGAGGAACAAATCCAACTTTTCT 

TTTCAGACATCCAACCATTACCAGCTrCTAAAGTCTTAGTCCAACTGGAAAAAGAAAATTTGTCCATTGACGACCG 

AGTTTTGGTTCAAAAACTACAACTAACTGTCCGTGGCCAAGAAAAAATCGGTATTATCGG 

GCCAAATGGTGTTGGGAAATCAACTCTGTTAGCCAAGTTACAGAGACTTCTGAATGATAAAAGAGAGATTTCACT 
TGGTTTTATGCCACAAGATTACCACAAAAAACTGCAATTGGATTTATCCCCAATAGCCTATCTCAGTAAAACTGGG 
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GAAAAAGAGGAACTACAGAAAATCCAATCrCACCTAGCTAGTCTCAATTTCAGTTATCCAGAAATGCAGCATCAA 

ATTCGCTCCTTATCTGGCGGACAACAGGGAAAACTCCTGCTTTTGGATTTAGTCCTGCGC 

TGCTGGATGAACCCACACGAAACTTTTCTCCCACTTCTC 

CGGTCTCATCACTGTTTCGCATGACCGTCGTTTCTTAAAAGAAGTCTGCTCGATCATCTATCGCATGACAGAACAC 
5 GGTTTGAAGCTAGTTAATTTAGAAGATTTATAA 

4179.4 

ATGAAACCAAAAACATTTTACAACTTGCTTGCCGAGCAGAATCTTCCACTTTCG 
GAACGTTATTTTGAGCTCTTGGTCGAGTGGAATGAGAAGATTAATTTGACGGCGATTACGGACAAGG 
10 TATCTCAAACATTTTTACGATTCGATTGCACCCATTCTTCAAGGTTTGATTCCCAATGAAACT^ 

TATCGGGGCTGGGGCAGGATTTCCTAGTCTACCAATGAAAATTCTCTATCCGGAGTTAGATGTGACCATTATTGAT 
TCACTCAATAAGCGCATCAACTTCCTACAACTCTTGGCTCAAGAACTGGATTTG 

GACGTGCCGAAGATTTTGCCCAAGACAAGAACTTCCGTGCTCAATATGATTTTGTAACAGCTCGTGCGGTTGCCCG 
TATGCAGGTCCTATCTGAATTGACTATTCCCTACCTTAAGGTTGGTGGCAAACTATTAGCACTCAAGGCTAGCAAT 
15 GCGCCTGAGGAATTATTAGAAGCTAAGAATGCCCTCAATCTCCTTTTTAGTAAGGTCGAAGACAATCT 

CCCTACCGAATAGAGATCCGCGCTATATCACAGTGGTAGAAAAGAAAAAAGAAACACCAAATAAATATCCACGTA 
AGGCTGGTATGCCAAATAAACGCCCACTTTAA 

4179.6 

20 ATGAGTATTAAACTAATTGCCGTTGATATCGACGGAACCCTTGTCAACAGCCAAAAGGAAATCACTCCTGAAGTTT 
TTTCTGCCATCCAAGATGCCAAAGAAGCTGGTGTCAAAGTCGTGATTGCAACTGGCCGCCCTATCGCAGGCGTTGC 
CAAACTTCTAGACGACTTGCAGTTGAGAGACGAGGGGGACTATGTGGT 

AACTGCTACAGGACATGAGATTATCAGCGAATCCirGACTTATGAGGATTATCTAGATATGGAATTCCTCAGT 
AAGCTCGGTGTCCACATGCATGCCATTACCAAGGACGGTATCTATACTGCAAATCGCAATATCGGAAAATACACT 

25 GTACACGAATCAACCCTCGTCAGCATGCCTATCTTCTACCGTACCCCTGAAGAAATGGCTGGCAAAGAAATTGTTA 
AATGTATGTTTATCGATGAACCAGAAATTCTCGATGCTGCGATTGAAAAAATTCCAGCAGAATTTTACGAGCGCT 
CTCCATCAACAAATCTGCTCCTTTCTACCTCGAACTCCTTAAAAAGAATGTAGACAAGGGTTCAGC 
TTGGCTGAAAAACTCGGATTGACCAAAGATGAAACCATGGCAATCGGTGATGAAGAAAATGACCGTGCCATGCTG 
GAAGTCGTTGGAAACCCCGTTGTCATGGAAAATGGAAATCCAGAAATCAAAAAAATCGCCAAATACATCACCAAA 

30 ACAAATGACGAATCCGGCGTTGCCCATGCCATCCGAACATGGGTACTGTAA 

4179.7 

ATGACTTGGATTATTCTTGGAGTTATCGCTCTTATTGTTATTTTTGTGATTGTTAGCT 

TCGTATGCAAACCAAGGAGGCTTGGAGTCAGATTGATGTTCAGTTGAAACGTCGCAATGACCTCTTGCCAAACTTG 
35 ATTGAGACTGTAAAAGGTTATGCCAAATATGAAGGTTCTACCCTTGAAAAGGTGGCAGAACTACGTAACCAAGTG 
GCGGCAGCGACTTCACCAGCAGAAGCTATGAAAGCCAGTGATGCCCTCACTCGTCAGGTTTCAGGTATTTTTGCAG 
TTGCAGAAAGCTATCCAGATTTGAAAGCTAGTGCTAACTTTGTTAAATTC 
ATAAAATTTCTTACTCrCGTCAACTCTATAACAGTGTTGTCAGCAACTACAATGTAAAA 

CAATATTATCGCTGGAATGTTTGGATTTAAAGCGGCAGATTTCCTTCAAACACCTGAAGAGGAAAAGTCGGTTCCT 
40 AAAGTTGATTTTAGCGGTTTAGGTGACTAA 

4179.8 

ATGTTGTTTGATCAAATTGCAAGCAATAAACGAAAAACCTGGATTTTGTTC 

TGTTGGTTATGCGGTTGGTTATCTCTTTATAAGATCTGGACTTGGTGGTTTGGTTATTGCACTGATTATCGGCT^ 
45 TCTACGCTTTGTCTATGATTTTTCAATCGACAGAGATTGTCATGTCCATGAATGGAGCGCGTGAGGTGGATGAGCA 
AACGGCACCAGACCrCTACCATGTAGTGGAAGATATGGCTCTGGTCGCTCAGATTCCTATGCCCCGTGTTTTCATC 
ATTGATGATCCAGCCTTAAATGCCITTGCGACAGGTTCTAATCCTCAAAATGCGGCTGTTGCTGCGACTTCAG 
TACTAGCTATCATGAATCGTGAAGAACTAGAAGCTGTTATGGGACATGAAGTCAGTCATATTCGTAATTATGATAT 
CCGTATTTCGACTATTGCAGTTGCCCTTGCTAGTGCTATCACCATGCTTTCTAGTATGGCAGGTCGTATGATGTGGT 
50 GGGGTGGAGCAGGTCGCAGACGAAGTGATGATGACCGAGATGGAAATGGTCTTGAAATCATTATGCTAGTGGTTT 
CCCrACTAGCTATTGTACTGGCACCTCTCGCTGCAACCTTGGTTCAGCTCGCTATTTCTCGTCAGAGGGAATTTCTG 
GCAGATGCATCTAGTGTCGAGCTGACTCGCAATCCCCAGGGAATGATTAATGCCCTAGATAAGTTGGACAATAGC 
AAACCTATGAGTCGCCACGTCGATGATGCTAGCAGTGCCCnTATATCAATGATCCTAAGAAAGGTGGGGGGTTC 
CAAAAACTCTTTTATACCCACCCACCTATCTCAGAACGGATTGAACGTTTAAAACAGATGTAA 

55 

4179.9 

ATGAAATTAAATATTCAAGAAATTCGTAAGCAGTCTGAAGGTTTGAACTTTGAACAAACGTTAGATTTAGTTGATG 
ACCTGCGTGCACGTAATCAAGAAATTTTAGATGTAAAAGATATCCTTGCAGTTGGGAAAGTACAATATGAAGACC 
GTATGTATTTCTTAGATTATCAACTATCTTATACCATTGTTCTTGCTTCGAGTCGCAGTATGGAGCCAGTTGAGTTA 
60 GTTGAATCTTATCCAGTCACGGAAGTTTTCATGGAAGGCGCAACTAACCAGCTAGATCAAGAAGTTTTAGATGATG 
ACTTGGTCTTGCCCATCGAAAATGGGGAGCTTGACCTTGCTGAGAGTGTATCAGACAATATCCTGCTAAACATTCC 
TATCAAGGTCTTGACGGCTGAAGAAGAAGCTGGTCAAGGATTTATCTCAGGAAATGACTGGCAAATCATGACAGA 
GGAAGAATACCAAGCTCAAAAAGCAGTAAAGAAAGAAGAAAACAGTCCTTTTGCTGGCITACAAGGACTATTTGA 
CGGAGATGAATAA 

65 
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4179.12 

ATGGAGTTATTTATGAAAATCACAAACTATGAAATCTATAAGTTAAAAAAATCAGGTTTGACCAATCAACAGATTT 
TGAAAGTGCTAGAATACGGTGAAAATGTTGATCAGGAGCTTTTGTTGGGTGATATTGCAGATATCT 
TAATCCAGCCGTTTTTATGGAACGTTATTTTCAGATAGACGATGCGCA 
5 TCTTTCTCTATTTTAGATGACTGTTATCCTTGGGATTTGAG^^ 

GGGAAATCTTGACCTCCTGAAATTCCCGAAGGTAGCGGTCGTGGGCAGTCGTGCTTGTAGCAAACAGGGAGCTAA 
GTCAGTTGAAAAAGTCATTCAAGGCTTGGAAAATGAACTGGTTATTGTCAGTGGTCTGGCCAAGGGCATTGACAC 
AGCAGCTCATATGGCAGCTCTTCAGAATGGCGGAAAAACCATTGCAGTGATTGGAACAGGACTGGATGTGTTTTA 
TCCTAAAGCCAATAAACGCTTGCAAGACTACATCGGCAATGACCATCTGGTTCTAAGTGAATATGGACCTGGTGA 
10 ACAACCTCTGAAATTTCATTTTCCTGCCCGTAATCGCATCATTGCTGGACTTTGTCGTGGTGTGATTGTAGCAGAG 
GCTAAGATGCGTTCAGGTAGTCTCATTACGTGTGAGCGAGCAATGGAAGAAGGACGCGATGTCTTTGCTATTCCTG 
GTAGCATTTTAGATGGACTATCAGACGGTTGCCATCATTTGATTCAAGAAGGAGCAAAATTGGTCACCAGTGGGC 
AAGATGTTCTTGCGGAATTTGAATTTTAA 

15 4181.1 

ATGAAACGTCAATTAGCCTTGGTCGTCTTTAGTGGTGGTCAAGATTCAACAACCT^ 

ACrATGAAACAGTCGAAGCTGTCACCTTTGCCTACGGCCAACGTCATCACCTCGAAATTCAAATTACTAGAGA^T 
CGCTAAGGAACAGGGCATTCGTCACCATATCCTCGATATGTCTCTGCTGGGACAAATCACTGCTCAGCCAGACTTT 
GCGACGATTCATATTTCCTACATTCCTGACAAGCTCTGTGTCGAGTCAAAATCCCTCAAACTATATCTATTTAGCT 
20 ACCGAAACCACGGAGATTTCCACGAAAACTGTATCAACACCATCGGGAAAGACTTGGTCAACTTGCTAGACCCTC 
GCTATTTAGAAGTCTGGGGAAAATTCACTCCGCGCGGTGGCATTTCAATCGACCCCTACTACAACTACGGTAAGCA 
AGGAACTAAGTATGAGGGCTTGGCAGAACAACGCCTCTTCCAACACGACCTTTATCCAGAGAAAATTGACAACCG 
CTAA 

25 4181.2 

ATGACCGAAACGGTAGAAGATAAAGTAAGTCATTCAATTACTGGGCTTGATATCCTCAAGGGGATAGTTGCTGCG 
GGAGCTGTCATAAGTGGAACCGTTGCAACTCAAACGAAGGTATTTACAAATGAGTCAGCAGTACITGAAAAAACT 
GTAGAGAAAACGGATGCrrTGGCAACAAATGATACAGTAGTTCTAGGTACGATATCTACAAGTAATTCAGCGAGT 
TCAACTAGTTTGTCAGCITCAGAGTCGGCAAGTACATCTGCATCTGAGTCAGCCTCAACCAGCGCTTCGACCTCAG 

30 CAAGTACAAGTGCATCAGAATCAGCAAGTACATCGGCTTCGACAAGTATTTCTGCATCATCTACTGTGGTAGGTTC 
ACAAACAGCTGCCGCTACAGAAGCAACTGCTAAGAAGGTCGAAGAAGATCGTAAGAAACCAGCTAGTGATTATGT 
AGCATCAGTTACAAATGTCAATCTCCAATCITATGCTAAGCGACGCAAGCGTTCAGTGGATTCCATCGAGCAATTG 
CTGGCTTCTATAAAAAATGCTGCTGTTTTTTCTGGCAATACGATTGTAAATGGCGCCCCTGCAATTAATGCAAGTC 
TAAACATTGCTAAAAGTGAGACAAAAGTTTATACAGGTGAAGGTGTAGATTCGGTATATCGTGTTCCAATTTACTA 

35 TAAATTGAAAGTGACAAATGATGGTTCAAAATTGACCTTTACCTATACGGTTACGTATGTGAATCCTAAAACAAAT 
GATCTTGGTAATATATCAAGTATGCGTCCTGGATATTCTATCTATAATTCAGGTACTTCAACACAAACAATGTTAA 
CCCITGGCAGTGATCTTGGTAAACCTTCAGGTGTAAAGAACTACATTACTGACAAAAATGGTAGACAGGTTCTATC 
CTATAATACATCTACAATGACGACGCAGGGTAGTGGGTATACTTGGGGAAATGGTGCCCAAATGAATGGTTTCTTT 
GCTAAGAAAGGATATGGATTAACATCATCTTGGACTGTACCAATTACTGGAACGGA 

40 TACATCCTTTACATTTACCCCTTACGCTGCTAGAACAGATAGAATTGGAATTAACTACTTCAATGGTGGAGGAAAG 
GTAGTTGAATCTAGCACGACCAGTCAGTCACTTTCACAGTCTAAGTCACTCTCAGTAAGTGCTAGTCAAAGCGCCT 
CAGCTTCAGCATCAACAAGTGCGTCGGCTTCAGCATCAACCAGTGCCTCGGCTTCAGCGTCAACCAGTGCGTCAG 
CTTCAGCAAGTACCAGTGCTTCAGTCTCAGCATCAACAAGTGCTTCAGCCTCAGCATCGACAAGTGCCTCGGCTTC 
AGCAAGCACATCAGCATCTGAATCAGCGTCAACCAGTGCTTCGGCTTCAGCAAGTACCAGTGCTTCAGCTTCAGC 

45 ATCAACCAGCGCCTCGGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGCGCCTCGGCCTCAGCAAG 
CACCTCAGCTTCTGAATCGGCCTCAACCAGCGCCTCAGCCTCAGCATCAACGAGTGCTTCGGCTTCAGCAAGCAC 
AAGCGCCTCGGGTTCAGCATCAACGAGTACGTCAGCTTCAGCGTCAACCAGTGCTTCAGCCTCAGCATCAACAAG 
TGCGTCAGCCTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGAGTGCGTCTGAGTCAGCATCAACGAGTAC 
GTCAGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGTGCGTCAGCCTCAGCATCGACAAGCGCCTC 

50 AGCTTCAGCAAGTACCAGTGCTTCAGCCTCAGCGTCGACAAGTGCGTCGGCCTCAACCAGTGCATCTGAATCGGC 
ATCAACCAGTGCGTCAGCCTCAGCAAGTACTAGTGCATCGGCTTCAGCATCAACCAGTGCCTCGGCTTCAGCGTCA 
ACCAGTGCGTCAGCTTCAGCAAGTACCAGTGCTTCAGTCTCAGCATCAACAAGTGCTTCAGCCTCAGCATCGACA 
AGTGCCTCGGCTTCAGCAAGCACATCAGCATCTGAATCAGCGTCGACAAGCGCCTCAGCTTCAGCAAGTACCAGT 
GCGTCAGCTTCAGCATCAACCAGCGCCTCGGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGCGCC 

55 TCGGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGCGCCTCAGCCTCAGCATCAACGAGTGCTTCG 
GCTTCAGCAAGCACAAGCGCCTCGGGTTCAGCATCAACGAGTACGTCAGCTTCAGCGTCAACCAGTGCTTCAGCC 
TCAGCATCAACAAGTGCGTCAGCCTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGAGTGCGTCTGAGTCA 
GCATCAACGAGTACGTCAGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGTGCGTCAGCCTCAGCA 
TCGACAAGCGCCTCAGCTTCAGCAAGTACCAGTGCTTCAGCCTCAGCTCGACAAGTGCGTCGGCCTCAACCAGTG 

60 CATCTGAATCGGCATCAACCAGTGCGTCAGCCTCAGCAAGTACTAGTGCATCAGCTTCAGCATCAACGAGTGCAT 
CGGCTTCAGCATCAACCAGTGCCTCGGCTTCAGCGTCAACCAGTGCGTCAGCTTCAGCAAGTACCAGTGCTTCAGT 
CTCAGCATCAACAAGTGCTTCAGCCTCAGCATCGACAAGTGCcTCGGCTTCAGCAAGCACATCAGCATCTGAATCA 
GCGTCGACAAGCGCcTCAGCTTCAGCAAGTACCAGTGCGTCAGCCTCAGCGTCGACAAGTGCGTCAGCCTCAGCA 
AGTACTAGTGCATCAGCTTCAGCATCAACGAGTGCATCGGCTTCGGCGTCAACCAGTGCATCAGAGTCAGCAAGT 

65 ACCAGTGCGTCAGCTTCCGCATCAACAAGTGCCTCGGCTTCAGCAAGCACCAGTGCGTCGGCTTCAGCAAGTACT 
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AGCGCCTCAGCCTCAGCCTCAACCAGTGCGTCAGCCTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGAGT 
GCGTCCGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCGTCAACAAGTGCATCGGCTTCAGCGTCAACGAGTGCG 
TCTGAATCGGCATCAACGAGTGCGTCCGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCGTCAACAAGTGCATCG 
GCTTCAGCATCAACGAGTGCGTCCGCTTCAGCAAGTACTAGCGCCTCAGCCTCAGCGTCAACAAGTGCATCGGCTT 
5 CAGCGTCAACGAGTGCGTCTGAGTCAGCATCAACGAGTGCGTCAGCCTCAGCAAGCACATCAGCTTCTGAATCTG 
CATCAACCAGTGGGTCAGCCTCAGCATCGACAAGCGCCTCAGCTTCAGCAAGTACCAGTGCGTCAGCCTCAGCGT 
CGACAAGTGCGTCGGCTTCAGCAAGTACCAGTGCGTCAGCCTCAGCAAGTACCAGTGCGTCAGCCTCAGCGTCGA 
CAAGTGCGTCGGCCTCAACCAGTGCATCTGAATCGGCATCAACCAGTGCGTCAGCCTCAGCAAGTACTAGTGCAT 
CAGCTTCAGCATCAACGAGTGCATCGGCTTCAGCATCAACCAGTGCATCAGAGTCAGCAAGTACCAGTGCGTCAG 
10 TTCCGCATCAACAAGTGCCTCGGCTTCAGCAAGTACTAG 



ATGGGGGTCGAAACTTGGTTTTATTCTAGCATCTGCTC^ 
ACATGACTGCTGCTAATGGCGGTGGAGGCTTTTTACTAATCITTCT 
15 CTGCTGGCTGAGTTTGCCCTTGGCCGTAGTGCTGGCGTTTCCGCT^ 

ACAAGTACAACnnrTATCGGTTGGATTGGCGCCTTTGCCCTCTTTATCCTCTTATCTTT^ 

TGGATTCTAGTCTATCTAGGTATTGAGTTTGGGAAATTGTTCCAACTTGGTGGAACGGGTGATTATGCTCAGTTAT 
TTACTTCAATCATTTCAAATCCAGCCATTGCCCTAGGAGCTCAAGCGGCCTTTATCCTATTGAATATCTTCATTGTA 
TCACGTGGGGTTCAAAAAGGGATTGAAAGAGCTTCGAAAGTCATGATGCCCCTGCTC^ 
20 TCATCGGTCGCTCTCTCAGTTTGCCAAATGCCATGGAAGGGGTTCTTT^ 

GACTAGCACTGGTCTCCTCTATGCTCTGGGACAATCTTTCTTTGCCCTCTCACTAGGGGTTACAGTCATGTTGACCT 
ATGCITCTTACTTAGACAAGAAAACCAATCTAGTCCAGTCAGGAATCTCCATCGTAGCCATGAATATCTCGATATC 
CATCATGGCAGGTCTAGCCATTTTCCAAGCTCGATCCCCCTTCAATATCCAGTCTGAAGGGGGACCCAGCCTGCTC 

TTTATCGTCTTGCCTCAACTCTTTGACAAGATGCCTTTTGGAACCATTTTCTAC 
25 TTTGCGACAGTCACTTTTTCTGTCGTGATGCTGGAAATCAATGTAGACAATATCACCAACCAGGATAACAGCAAAC 
GTGCCAAATGGAGTGTTATTTTAGGAATTTTGACCTTTGTCTTTGGCATTCCTTCAGCCCT 
GCGGATGTTCACATTTTTGGTAAGACCTTCTTTGACGCTATGGACTTCTTGGTTTCCAATCT 
AGCTCTCTACCTTTCACTTTTTACAGGCTATATCTTTAAAAAGGCTCTTGCAATGGAGGAACT 

AGAGCATGGAAACAAGGACTGTTCCAAGTCTGGCTCTTCCTTCTTCGTTTCTTCGTTTCGTCATTCCAATCATCATC 
30 ATTGTGGTCTTCATTGCCCAATTTATGTAATCAAAAAGGACTTGAGTAG 

4183.5 

ATGTTGAAAAAATGGCAGTTAAAAGATGTTATCITGCTTGCTTTCTTGTC 

TTCAGGATATGTGTATAATATTCTCAGTCTACTCTTAACACCTCTTGGTTTGCAGGCCTTTGCCAATGAAATCCTCT 
35 TCGGTCTCTGGTGTATGGCTGCGCCCATTGCTGCCATCTTTGTTCCGAGAGTCGGAAGTGCAACGATTGGAGAAGT 
GCTAGCTGCGCTTGCTGAAGTCCTTTATGGTAGCCAATTTGGTCTAGGAGCTCTTTTGTCTGGCTT^ 
TGGGAAGTGAATTTGGTTTTATCGTAACTAAGAATCGCTATGAAAGTTGGCTCTCTCTAACTGCT 
GATTACGCTTGTTAGCTTTGTCTATGAATACATTAAGTTAGGTTAC 

GCTTGTGGTACGTTTTATTTCTGTTTATTTCTTCTGTACCATCTTGGTTCGTGCCATTGTCAAACT 
40 TGCAACTGGAGGAAAAGCATAG 



ATGGTCAAAGTAGCAACCCAGACACCGATTATCAGTCTCTTCTTGCTGATTTTATCCTTGGAAACATCTTTCATTCC 
TTCGATTGCTCTGACTCTTTCGGTAGTCGCATTTTGTATTCTCTTTATGCTCTATTACCGTCGATTTA 
45 CTTGGATGATCATACTTGCCATTTTACCATCTTTTGCCAACTACTGGGCAGTTCAGTTACA 

GGCAGTCATGCnTGGAACGAGGGCCTTTGTGACAGTTTGTATCGGCCTTGTCTTTGTTTCCTCTGTTTCACTAAAAG 

AGCTTCTCTTGTACTTGGCTCAAAAGGGGCTATCACGCTCTTGGTCCT 

CCTCTCATTCAGCAAGAAATCAAGTCCCTCAAAGAAGCTTGCCTATTACGTGGTCAAGAACTACATTTTTGGTCGC 
CCrTTGATTTACAGTAAGGTTCTGATGACAGTCTTTAGGTGGCGCCATCTTTACCTGAGAGCTCTATCTGCTCACGG 
50 ATATGACGAACATGCACAGTTGAAGAATAGCTATCGGACTTTTTATATTCCTAAAAAAACAAAATTAATCTACCTG 
CTTTTCTTTTTATTGCTTCAAACCAGTCTATTTTTATAA 

4183.7 

ATGAGAAAGCACCAATTACAAGTTCACAAATTAACCATTTTATCTATGATGATTGCCCTTGATGTAGTCCTTACAC 
55 CTATCTTTCGAATTGAGGGAATGGCACCGATGTCCAGTGTAGTCAATATTCTAGCAGGAATCATGATGGGACCTGT 
TTATGCCTTGGCTATGGCTACAGTCACAGCCTTTATCCGTATGACGACTCAAGGGATTCCGCCTTTAGCTCTCACA 
GGAGCGACTTTTGGAGCCCTTCTAGCAGGTCTCTTTTATAAGTACGGTCGAAAATTTCACTATTCTGCT 
AGATTTTGGGAACAGGTATTATTGGTTCCATTGTTTCCTATCCTGTTATGGTACTCTTTACAGGATCAGCTGCTAAG 
CTTAGCTGGTTTATCTACACGCCTCGATTTTTCGGAGCAACCTITGATTGGTACAGCGATTTCC 
60 ATTTTTAATCAAGCAGGAATTCTTTAAAAAAGTGCAGGGATATTTCTTTAGTGAAAGGATAGACTGA 

4183.8 

ATGCAGGAATTTACAAATCCCTTTCCTATAGGCTCTAGTTCCCTCATTCACTGCATTACCAATGAGATTTCTTGTGA 
GATGCTGGCAAATGGGATTTTGGCTCTGGGATGCAAACCTGTCATGGCAGATGATTCCCGTGAAGTTCTTGATTTT 
65 ACTAAGCAAAGTCAGGCTCTCTTCATCAATTTGGGGCATTTGTCAGCTGAGAAGGAAAAAGCAATCCGCATGGCA 
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GCTTCGTATGCAAACCAATCTTCTCTCCCGATGGTAGTAGATGCGGTTGGCGTAACGACTTCATCCATTCGTAAGA 
GCnTAGTTAAAGACCTTTTAGACTATAGACCTACGGTCCTTAAAGGAAACATGTCAGAAATTCGAAGTCTTGTTGG 
ATTAAAGCACCACGGCGTTGGGGTCGATGCGAGTGCTAAAGATCAAGAAACGGAGGATTTGCTTCAAGTCTTGAA 
AGACTGGTGTCAGACCTATCCTGGTATGTCTTTCTTAGTCACAGGTCCCAAGGACCTCGTCGTTTCGAAAAATCAG 
GTCGCTGTACTGGGAAATGGCTGTACTGAATTAGACTGGATAACAGGGACAGGAGACTTGGTTGGAGCCTTAACA 
GCTGTTTTTCTCAGCCAAGGAAAGACTGGTTTTGAAGCTTCTTGCTTAGCAGTCTCT^ 

GAAAATAGTTGTTCAAGGAATGGGATTGGAAGAATTTCGTTACCAAGTACTCAATCAGCTTTCGCTCCTAAGAAG 
AGATGAAAATTGGCTAGATACCATCAAAGGAGAGGTTTATGAATAG 



4185.3 

ATGAACCATAAAATCGCAATTTTATCAGATGTTCATGGCAATGCGACGGCGCTAGAAGCAGTGATTGCAGATGCT 

AAAAATCAAGGGGCCAGTGAATATTGGCTTCTGGGAGATATTTTTCTTCCTGGTCCAGGCGCAAATGACTTAGTC^ 

CCCTGCTAAAGGACCTTCCTATCACAGCAAGTGTTCGAGGCAATTGGGATGATCGTGTCCTTGAGGCTTTAGATGG 

GCAATATGGCTTAGAAGACCCACAGGAAGTTCAGCTCTTGCGTATGACACAGTATTTGATGGAGCGAATGGATCC 

TGCAACGATTGTCTGGCTACGAAGCTTGCCTTTGCTGGAAAAGAAAGAAATTGACGGA 

CATAATTTACCTGACAAAAACTATGGTGGTGACTTGCTAGTTGAGAATGATACAGAGAAATTTGACCAACTGCT 

GATGCGGAAACGGACGTGGCAGTTTATGGTCATGTTCACAAGCAGTTGCTTCGTTATGGAAGTCAAGGGCAACAA 

ATCATCAATCCAGGGTCGATTGGCATGCCCTATTTTAATTGGGAGGCGTTAAAAAATCACCGTTCCCAGTATGCCG 

TGATAGAAGTTGAAGATGGGGAATTACTCAATATCCAATTTCGTAAAGTTGCTTATGATTACGAAGCTGAGTTAGA 

ATTGGCCAAGTCCAAGGGGCTTCCCTTTATCGAAATGTATGAAGAACTGCGTCGTGACGATAACTATCAGGGGCA 

CAATCTGGAATTATTAGCCAGCTTAATAGAAAAGCATGGGTATGTAGAGGATGTGAAGAATT^^ 

TAA 

4186.1 

ATG A ATGT AA ATC AG ATTGT ACG G ATT ATTC CT A CTTT AAA AG CT AAT AAT AG AAAATT AA ATG AAA C ATTTT AT A 

TTGAAACCCTTGGAATGAAGGCCTTGTTAGAAGAATCGGCCTTTCTGTCACTAGGTGACCAAACGGGTCT^ 

GCTGGTITTAGAAGAAGCTCCCAGTATGCGTACTCGTAAGGTAGAGGGAAGAAAAAAACTAGCTAGATTGATTGT 

CAAGGTGGAAAATCCCTTAGAAATTGAAGGAATCTTATCTAAAACAGATTCGATTCATCGATTATATAAAGGTCA 

AAATGGCTACGCTTTTGAAATTTTCTCACCAGAAGATGATT^^ 

CTAGTAGAAGTAGGAGAAAAGCCrGAATTTCAAACAGATTTGGCATCAATTTCIT^ 
TGGAATTACATCTCCCAACTGATATCGAAAGTTTCTTGGAATCATCT^ 

AGCTCAGGGGCAGGATTTGACTGTGGACAATACGGTTACCTGGGACTTATCTATGCTCAAGTTCTTGGTCAATGAA 
TTAGACATAGCAAGTCTTCGCCAGAAGTTTGAGTCTACTGAAT ATTTT ATTCCTAAGTCTGAAAAATTCTTCCTTG 
GT AA AG AT AG AAAT AATGTTG AATTGTG GTTTG A AG A AGT ATG A 

4186.2 

ATGAAGTGGACCAAGATTATTAAAAAAATAGAAGAACAAATCGAGGCAGGGATTTATCCCGGAGCCTCTTTTGCG 

TATTTTAAGGACAATCAATGGACAGAGTTCTATTTAGGCCAGAGTGACCCAGAGCATGGCTTGCAGACTGAGGCA 

GGACTAGTTTATGACCTAGCTAGTGTCAGCAAGGTTGTTGGGGTTGGCACAGTTTGTACCTTCrrTGTGGGAAATAG 

GTCAATTAGATATTGATAGACTGGTAATAGATTTTTTACCTGAGAGTGATTATCCAGACATCACTATTCGCCAGCT 

CTTGACTCATGCAACAGACCnTGATCCTTTTATTCCTAATCGTGATCTTTTAACAGCCCCTGAATTAAAGGAAGCG 

ATGTTTCATCTCAACAGACGAAGTCAGCCAGCCTTTCTTTATTCGGATGTCCATTTTTT 

GGAAAGAATTTTTAATCAAGATTTGGATGTGATTTTAAAGGATCAAGTCTGGAAACCITGGGGAATG 

TAAGTTTGGGCCAGTTGAGCTTGCTGTTCCAACAGTTAGAGGTGTAGAGGCAGGCATAGTGCATGATCCCAAGGC 

TCGTCTCCTGGGTAGACATGCTGGGAGTGCTGGTTTATTTTCGACTAT^ 

TATTTAGCAGATGATTTTGCAAGAGACHTAAATCAAAATTTTTCT^ 

GGAATTTGGAAGGAGATTGGCTAGACCATACGGGCTATACAGGTACCTTTATCATGTGGAATCGTCAGAAGCAAG 
AAGCCACTATTTTCCTATCGAATCGTACCTATGAAAAGGACGAGAGAGCTCAATGGATATTAGACCGCAATCAAG 
TGATGAACTTGATTCGCAAAGAAGAGTAA 

4187.2 

ATGATGAAGAAGACTTATAATCATATTTTGGTCTGGGGAGTCATTTTCTATAGC 

TACTCCTCAAGAACAATCTACCGTGGGAGTGGGAACTCCAGGTATTCAGCATCTTGGACGCCTGGTTTTTCTTTTG 
ACTCCTTTCAATTCTCTCTGGAAACTGGGCGAAGTGAGTGACATTGGACAATTATGTTGGATTTT^ 
TCCTCAATGTCTTCTTGTTTTTTCCTCTGATTTTCC 
GTCCTTCTTTTTAGTTTTCTTGTGAGTCTTGGAATCGAGTGTAOT 

CGCGTCTTTGAGATTGATGATTTGTGGACCAACACTTTGGGTGGCTATCTGGCTTGGCTCCTTTATAAACGATTAC 
ATAAAAACAAGGTAAGGAATTAA 
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4188.1 

ATGAAGATTCCTCTCTTAACITTTGCAAGGCATAAATTTGTTTATGTCIT 

TATCGTGATGTTTTGATGACTTATTTCTTTTTTGATATTCATGCGCCCGATCTAGCTAAATTCGATGGACAAGCA 

TAAAAATGACTTATTAAAATCAGCATTAGATTTTCGTATT^ 
5 TTCCAATCATCATTGTTTTGCTAGGTTTTCAATATATTGAGCTGAAAAATAAAGTTTTACGATTGAGTATTGGAAG 

AGAAGTGAGTTATCAAGGGTTAAAAAGAAAGTTGACITTGCAAGTTG 
ACTGTGCTGATAATTGCAATTATAACCTATTTCTTTGGGACTTTT^ 

GGAAGTGGTTTACAAAGACTCCTAGATGGAGAGATAAAAAGCTATTTGTTCTTTACTTGTGTCCTACTAATCGGTA 
TTTTCATCAATGCAATCTATTTTTTACAAATAGTT^ 
10 ATGTTTCTTTGGCTTGGTTCTATGCTGCT^ 

AGCTAGCTATGGGGATGTAAGTTTGATGAAACTCTTTACTCCTTATATCCTTTA 
AAAAATATGAAGATAATGTTTAA 



15 ATGAAGATAATGTTTAAGAATTTTAACAATATTTTGCTAAATAGAAAGATTGTTTTACTA 
GATGATTTTGATAAACCATCTATTGTCAACAGCGGTTCAA 
ATTTCAATTTTTTCCTATAATGACTATTCTGAAGCGAATTTAGAAATCCCCA^ 

CATGGTAGGATGGCTCTCTGTCATTTTACTTGAAAGTGATTTGGCAGACCATTACCATCACTTGATTCGCTATCAA 
TCAAGCTCCTTTTTCGATTATACAAGGAAACGATTGGTTGTCATTTCTAAATTT^ 
20 GTTTCTTGGTTTACTTCCTCTAGGAATTCATTTCAAAACAGTCGCACTTTTCTTTTT 
GTACTTACTACTGTCTTATCTGATAGCACTGATTAGTGCGGGCGCTC 

TGTGGGACAAGAATGGATGATGGATCATATTGTAACAGTGTATTTAGTACTCTTAAGTTTATTAGTTATGTTGATT 
GTTAGTCGCTTGGAAGAGAAATTTAAGAAAGGATAA 

25 4188.5 

ATGGGCAAAGGAGAGATGGGCAAAGGAGTTATTGGCTTGGAGTTCGACTCAGAAGTATTGGTCAACAAGGCTCCA 
ACCCTTCAATTGGCAAATGGTAAAACAGCGACTTTCCTAACCCAGTATG^ 

ATAAGGAAGATATCGGACAGGAAATTATTGGTATAGCTAAAGGAAGCATCGAAAGTATGCATAATCTTCCTGTAA 
ATCTAGCAGGTGCCAGAGTTCCTGGCGGAGTAAATGGTAGCAAAGCAGCGGTGCATGAAGTTCCAGAATTTACAG 
30 GGGGAGTTAATGGTACAGAGCCAGCTGTTCATGAAATCGCAGAGTATAAGGGATCTGATTCGCTTGTAACTCTTAC 
TACAAAAAAAGATTATACITACAAAGCTCCTCTTGCTCAGCAGGCACTTC 
CCTCCTAGCTTCACTAGGACTAACAGCTTTCTTCCTTGGTCTGTTTACGCTAGGGAAA 

4188.10 

35 ATGTTTAAAGTTTTACAAAAAGTTGGAAAAGCTTTTATGTTACCTATAGCTATACTTCCTGCA 

GGGGATTGGTGGTGCACITTCAAACCCAACCACGATAGCAACTTATCCAATACTAGACAATAGTATTTTTCAATCA 
ATATTCCAAGTAATGAGCTCTGCAGGAGAGGTTGTATTCAGTAATTTGTCACTACTTCTCTGTGTGGGATTATGTA 
TTGGCTTAGCGAAACGAGATAAAGGAACCGCTGCGTTAGCAGGAGTAACTGGTTACTTAGTTATGACTGCAACGA 
TCAAAGCTTTGGTAAAACTTTTTATGGCAGAAGGATCTGCAATTGATACTGGAGTTATTGGAGCATTAGTTGTCGG 

40 AATAGTTGCCGTATATTTGCACAACCGATATAACAATATTCAATTACCTTCCGCTTTAGGATTCTTTGGAGGTTCA 
CGCTTCGTTCCTATTGTTACATCGTTCTCTTCTATCTTGATTGGCTTTGTCTTCTTTGTTATT^ 
CAACTTCTTGTTTCTACAGGTGGATATATTTCTCAGGCGGGTCCAATTGGAACT^ 

ACTTTCTGGAGCAGTAGGCTTACATCATATAATTTACCCTATGTTTTGGTATACTGAACTTGGTGGTGTTGAAACT 
TTGCAGGACAAACAGTGGTTGGAGCTCAAAAAATATTTTTTGCTCAATTAGCCGATTTGGCCCATTCTGGATTATT 

45 TACAGAAGGAACAAGGTTTTTTGCAGGTCGTTTCTCAACAATC 

TACCATAGTGTTCCTAAAAATCGTCGTAAAAAATACGCGGGTTTGTTTTTTGGAGTTGCTTTAACATCTT^ 
CGGTATTACAGAACCAATTGAATTTATGTTTCTATTCGTCAGTCCGGTTCTATATGTTGTTCACGCATTCCTTGA 
GTGTTAGCTTCTTTATTGCAGACGTCTTAAATATTTCAATAGGAAACACATTTTCAGGAGGTGTA^ 
TTATTTGGAATTTTGCAGGGGAACGCTAAGACGAATTGGGTTCTTCAGATTCCATTTGGACTTATTTGG 

50 GTATTATATTATTTTTAGATGGTTCATTACTCAATTCAACGTTCTAACGCCAGGGCGAGGAGAAGAAGTAGATTCT 
AAAGAAATTTCTGA^TCCGCAGATTCAACTTCAAATACTGCAGATTATTTAAAACAGGATAGCCTACAAATTATCA 
GAGCCTTGGGTGGATCAAATAATATAGAAGATGTAGATGCTTGTGTGACACGTTTACGTGTAGCTGTAAAAGAAG 
TTAATCAAGTTGATAAAGCACTTTTAAAACAAATTGGTGCAGTTGATGTCTTAGAAGTGAAGGGTGGCATTCAAGC 
AATCTATGGAGCAAAAGCAATCTTATATAAAAATAGTATTAATGAAATTTTAGGTGTAGATGATTAA 

55 

4188.11 

ATGAAATTTAGAAAATTAGCTTGTACAGTACTTGCGGGTGCTGCGG 

GCGGAAGTAAAGATGCTGCCAAATCAGGTGGTGACGGTGCCAAAACAGAAATCACTTGGTGGGCATTCCCAGTAT 
TTACCCAAGAAAAAACTGGTGACGGTGTTGGAACTTATGAAAAATCAATCATCGAAGCGTTTGAAAAAGCAAACC 

60 CAGATATAAAAGTGAAATTGGAAACCATCGACTTCAAGTCAGGTCCTGAAAAAATCACAACAGCCATCGAAGCAG 
GAACAGCTCCAGACGTACTCTTTGATGCACCAGGACGTATCATCCAATACGGTAAAAACGGTAAATTGGCTGAGT 
TGAATGACCTCTTCACAGATGAATTTGTTAAAGATGTCAACAATGAAAACATCGTACAAGCAAGTAAAGCTGGAG 
ACAAGGCTTATATGTATCCGATTAGTTCTGCCCCATTCTACATGGCAATGAACAAGAAAATGTTAGAAGATGCTGG 
AGTAGCAAACCTTGTAAAAGAAGGTTGGACAACTGATGATTTTGAAAAAGTATTGAAAGCACTTAAAGACAAGGG 

65 TTACACACCAGGTTCATTGTTCAGTTCTGGTCAAGGGGGAGACCAAGGAACACGTGCCTTTATCTCTAACCTTTAT 
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AGCGGTTCTGTAACAGATGAAAAAGTTAGCAAATATACAACTGATGATCCTAAATTCGTCAAAGGTCTTGAAAAA 
GCAACTAGCTGGATTAAAGACAATTTGATCAATAATGGTTCACAA 

GCCAACGGTCAAACATCTTACACAATCCTTTGGGCACCAGCTCAAAATGGTATCCAAGCTAAACTTTTAGAAGCA 
AGTAAGGTAGAAGTGGTAGAAGTACCATTCCCATCAGACGAAGGTAAGCCAGCTCTTGAGTACCTTGTAAACGGG 
TTTGCAGTATTCAACAATAAAGACGACAAGAAAGTCGCTGCATCTAAGAAATTCATCCAGTTTATCGCAGATGAC 
AAGGAGTGGGGACCTAAAGACGTAGTTCGTACAGGTGCTTTCCCAGTC^ 

ACAAACGCATGGAAACAATCAGCGGCTGGACTCAATACTACTCACCATACTACAACACTATTGATGGATTTGCTG 
AAATGAGAACACTTTGGTTCCCAATGTTGCAATCTGTATCAAATGGTGACGAAAAACCAGCAGATGCTTTGAAAG 
CCTTCACTGAAAAAGCGAACGAAACAATCAAAAAAGCTATGAAACAATAG 

4188.12 

ATGCAATCTACAGAAAAAAAACCATTAACAGCCTTTACTGTT^ 

TGTTCATCTTTCCATTCTACTGGATTTTGACAGGGGCATTCAAATCACAACCTGATACAATTGTTATTCCTCCTCAG 
TGGTTCCCTAAAATGCCAACCATGGAAAACTTCCAACAACTCATGGTGCAGAACCCTGCCTTGCAATGGATGTGG 
AACTCAGTATTTATCTCATTGGTAACCATGTTCTTAGTTTGTGCAACCTCATCTCTAGCAGGTTATGTATTGGCT 
AAAACGTTTCTATGGTCAACGCATTCTATTTGCTATCTTTATCGCTGCTATGGCGCT^ 

TACCATTGGTACGTATCGTCAACTTCATGGGAATCCATGATACTCTCTGGGCAGTTATCITGCCTTTGATTGGATG 

GCCATTCGGTGTCITCCTCATGAAACAGTTCAGTGAAAATATCCCTACAGAGTTGCTTGAATCAGCTAAAATC 

GGTTGTGGTGAGATTCGTACCTTCTGGAGTGTAGCCTTCCra 

TTACCITCATCAATACrrGGAATGACTACTTCATGCAATTGGTAATGTTGACTTCACGTAACA 

CTTGGGGTTGCGACCATGCAGGCTGAAATGGCAACCAACTATGGTTTGATTATGGCAGGAGCTGCCCTTGCTGCTG 
TTCCAATCGTCACAGTCTTCCTAGTCTTCCAAAAATCCTTCACACAGGGTATTACTATGGGAGCGGTCAAAGGATA 
A 

4191.1 

ATGAAAAAAACTTTTTTCTTACTGGTGTTAGGCTTGTTTTGCCTTCTTCCACT 

ATAAACTCTTATCAAGGGGATTTGTATATTCATGCAGACAATACGGCAGAGTTTAGACAGAAGATAGTTTACCAGT 
TTGAGGAGGACTTTAAGGGCCAAATCGTGGGACTTGGACGTGCT 

CTCATCCAAAGATTCAGGCCGCGAAAAACGGTGCAGAACTAGCAGATGTGACTAGCGAAGTAACAGAAGAAGCG 
GATGGTTATACTGTGAGAGTCTATAATCCAGGTCAGGAGGGCGACATAGTTGAAGTTGACCTCGTCTGGAACTTA 
AAAAATTTACTTTTCCTTTATGATGATATCGCTGAATTAAATTGGCAACCTCTGACAGATAG 
AAAAGTTTGAATTTCATGTAAGGGGAGACAAGGGGGCTGAAAAACTCT^^ 

AGGGAACGATTGAAAAGAGTAACCTTGATTATACTATCCGTTTAGACAATCTTCCGGCTAAGCGTGGAGT^ 
GCATGCCTATTGGCCTCGGACCGATTTTGCTAGCGCTAGGGATCAGGGATTGAAAGGGAATCGTTTAGAAGAGTTT 
AATAAGATAGAAGACTCGATTGTTAGAGAAAAAGATCAGAGTAAACAACTCGTTACTTGGGTCCTCCCTTCGATC 
CTTTCCATCTCCHTGTTATTGAGTGTCTGCTTCT^ 

CAAAAATCATCGTCTCTATGAACCACCAATGGAATTAGAGCCTATGGTTTTATCAGAAGCAGTCTACTCGACCTCC 
TTGGAGGAAGTGAGTCCCTTGGTCAAGGGAGCTGGAAAATTCACCTTTGATCAACTTATTCAAGCTACCTT 
ATGTGATAGACCGTGGGAATGTCTCTATCATTTCAGAAGGAGATGCAGTTGGTTTGAGGCTAGTAAAAGAAGATG 
GTTTGTCAAGCTTTGAGAAAGACTGCCTAAATCTAGCTTTTTCAGGTAAAAAAGAAG 

TGCGGATTACAAGGTATCTGATAGTCTTTATCGTAGAGCCAAAGTTTCTGATGAAAAACGGATTCAAGCAAGAGG 
GCTTCAACTCAAATCHTCTTTTGAAGAGGTATTGAACCAGATC 

GGGCTCCCAGATTATTATCGTCCTTTAACTGGTGGGGAAAAGGCCTTGCAAGTGGGTATGGGTGCCTTGACTATCC 

TGCCCCTATTTATCGGATTTGGTTTGTTCTTGTACAGTTTAGACGTTCATGGCTATCTT^ 

CTTGGTTTTCTAGGGTTAGTTTTGTCTGTTTTCTATTATTGGAAGCTTCGACTAGAT 

TGAAGCGGGAGCTGAGGTCTACTATCTCTGGACCAGTTTTGAAAATATGTTGCGTGAGATTGCACGATTGGATCAG 
GCTGAACTGGAAAGTATTGTGGTCTGGAATCGCCTCTTGGTCTATGCGACCTTATTTGGCTATGCGGACAAGGTTA 
GTCATTTGATGAAGGTTCATCAGATTCAAGTGGAAAATCCAGATATCAATCTCTATGTAGCTTATGGCTGGCACAG 
TACGTTTTATCATTCAACAGCACAAATGAGCCATTATGCTAGTGTCGCAAATACAGCAAGCACCTACTCTGTATCT 
TCTGGAAGTGGAAGTTCTGGTGGTGGCTTCTCTGGAGGCGGAGGTGGCGGCAGTATCGGTGCCTTTTAA 

4191.2 

ATGAAAAAAGTAAGAAAGATATTTCAGAAGGCAGTTGCAGGACTGTGCTGTATATCTCAGTTGACAGCTTTTTCTT 

CGATAGTTGCTTTAGCAGAAACGCCTGAAACCAGTCCAGCGATAGGAAAAGTAGTGATTAAGGAGACAGGCGAAG 

GAGGAGCGCTTCTAGGAGATGCCGTCTTTGAGTTGAAAAACAATACGGATGGCACAACTGTTTCGCAAAGGACAG 

AGGCGCAAACAGGAGAAGCGATATTTTCAAACATAAAACCTGGGACATACACCTTGACAGAAGCCCAACCTCCAG 

TTGGTTATAAACCCTCTACTAAACAATGGACTGTTGAAGTTGAGAAGAATGGTCGGACGACTGTCCAAGGTGAAC 

AGGTAGAAAATCGAGAAGAGGCTCTATCTGACCAGTATCCACAAACAGGGACTTATCCAGATGTTCAAACACCTT 

ATCAGATTATTAAGGTAGATGGTTCGGAAAAAAACGGACAGCACAAGGCGTTGAATCCGAATCCATATGAACGTG 

TGATTCCAGAAGGTACACTTTCAAAGAGAATTTATCAAGTGAATAATTTGGATGATAACCAATATGGAATCGAATT 

GACGGTTAGTGGGAAAACAGTGTATGAACAAAAAGATAAGTCTGTGCCGCTGGATGTCGTTATCTTGCTCGATAA 

CTCAAATAGTATGAGTAACATTCGAAACAAGAATGCTCGACGTGCGGAAAGAGCTGGTGAGGCGACACGTTCTCT 

TATTGATAAAATTACATCTGATTCAGAAAATAGGGTAGCGCTTGTGACTTATGCTTCCACTATCTTTGATGGGACC 

GAGTTTACAGTAGAAAAAGGGGTAGCAGATAAAAACGGAAAGCGATTGAATGATTCTCTTTTTTGGAATTATGAT 

CAGACGAGTTTTACAACCAATACCAAAGATTATAGTTATTTAAAGCTGACTAATGATAAGAATGACATTGTAGAAT 
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TAAAAAATAAGGTACCTACCGAGGCAGAAGACCATGATGGAAATAGATTGATGTACCAATTCGGTGCCACTTTTA 
CTCAGAAAGCTTTGATGAAGGCAGATGAGATTTTGACACAACAAGCGAGACAAAATAGTCAAAAAGTCATTTTCC 
ATATTACGGATGGTGTCCCAACTATGTCGTATCCGATTAATTTTAATCATGCTACGTTTGCTCCATCATATCAAAAT 

CAACTAAATGCATTTTTTAGTAAATCTCCTAATAAAGATGGAAT^ 
5 GTGGAGAACATACAATTGTACGCGGAGATGGGCAAAGTTACCAGATGTTTACAGATAAGACAGTTTATGAAAAAG 
GTGCTCCTGCAGCTTTCCCAGTTAAACCTGAAAAATATTCTGAAATGAAGGCGGCTGGTTATGCAGTTATAGGCGA 
TCCAATTAATGGTGGATATATTTGGCTTAATTGGAGAGAGAGTATTCTGGCnTATCCGTTTAATTCTAATACT 
AAATTACCAATCATGGTGACCCTACAAGATGGTACTATAACGGGAATATTGCTCCTGATGGGTATGATGTCTTTAC 
GGTAGGTATTGGTATTAACGGAGATCCTGGTACGGATGAAGCAACGGCTACTAGTTTTATGCAAAGTATTTCTAGT 

10 AAACCTGAAAACTATACCAATGTTACTGACACGACAAAAATATTGGAACAGTTGAATCGTTATTTCCACACCATC 
GTAACTGAAAAGAAATCAATTGAGAATGGTACGATTACAGATCCGATGGGTGAGTTAATTGATTTGCAATTGGGC 
ACAGATGGAAGATTTGATCCAGCAGATTACACTTTAACTGCAAACGATGGTAGTCGCTTGGAGAATGGACAAGCT 
GTAGGTGGTCCACAAAATGATGGTGGTTTGTTAAAAAATGCAAAAGTGCTCTATGATACGACTGAGAAAAGGATT 
CGTGTAACAGGTCTGTACCITGGAACGGATGAAAAAGTTACGTTGACCTACAATGTTCGTTTGAATGATGAGTTTG 

15 TAAGCAATAAATTTTATGATACCAATGGTCGAACAACCTTACATCCTAAGGAAGTAGAACAGAACACAGTGCGCG 
ACrrCCCGATTCCTAAGATTCGTGATGTGCGGAAGTATCCAGAAATCACAATTTCAAAAGAGAAAAAACT^ 
ACATTGAGTTTATTAAGGTCAATAAAAATGATAAAAAACCACTGAGAGGTGCGGTCITTAGTCTTCAAAAACAAC 
ATCCGGATTATCCAGATATTTATGGAGCTATTGATCAAAATGGCACTTATCAAAATGTGAGAACAGGTGAAGATG 
GTAAGTTGACCnTTAAAAATCTGTCAGATGGGAAATATCGATTATTTGAAAATTCTGAACCAGCTGGTTATAAACC 

20 CGTTCAAAATAAGCCTATCGTTGCCTTCCAAATAGTAAATGGAGAAGTCAGAGATGTGACTTCAATCGTTCCACAA 
GATATACCAGCGGGTTACGAGTTTACGAATGATAAGCACTATATTACCAATGAACCTATTCCTCCAAAGAGAGAA 
TATCCTCGAACTGGTGGTATCGGAATGTTGCCATTCTATCTGATAGGTTGCATGATGATGGGAGGAGTTCTATTAT 
ACACACGGAAACATCCGTAA 

25 

4191.3 

ATGAAATCAATCAACAAATTTTTAACAATGCTTGCTGCCTTATTACTGACAGCGAGTAGCCTGTTTTCAGCT 
CAGTTTTTGCGGCTGGGACGACAACAACATCTGTTACCGTTCATAAACTATTGGCAACAGATGGGGATATGGATA 
30 AAATTGCAAATGAGTTAGAAACAGGTAACTATGCTGGTAATAAAGTGGGTGTTCTACCTGCAAATGCAAAAGAAA 
TTGCCGGTGTTATGTTCGTTTGGACAAATACTAATAATGAAATTATTGATGAAAATGGCCAAACTCTAGGAGTGAA 
TATTGATCCACAAACATTTAAACTCTCAGGGGCAATGCCGGCAACTGCAATGAAAAAATTAACAGAAGCTGAAGG 
AGCTAAATTTAACACGGCAAATTTACCAGCTGCTAAGTATAAAATTT 

GGTGAAGATGGAGCAACCTTAACAGGTTCTAAAGCAGTTCCAATTGAAATTGAATTACCATTGAACGATGTTGTG 

35 GATGCGCATGTGTATCCAAAAAATACAGAAGCAAAGCCAAAAATTGATAAAGATTTCAAAGGTAAAGCAAATCCA 
GATACACCACGTGTAGATAAAGATACACCTGTGAACCACCAAGTTGGAGATGTTGTAGAGTACGAAATTGTTACA 
AAAATTCCAGCACTTGCTAATTATGCAACAGCAAACTGGAGCGATAGAATGACTGAAGGTTTGGCATTCAACAAA 
GGTACAGTGAAAGTAACTGTTGATGATGTTGCACTTGAAGCAGGTGATTATGCTCTAACAGAAGTAGCAACTGGTT 
TTGATTTGAAATTAACAGATGCTGGTTTAGCTAAAGTGAATGACCAAAACGCTGAAAAAACTGTGAAAATCACTT 

40 ATTCGGCAACATTGAATGACAAAGCAATTGTAGAAGTACCAGAATCTAATGATGTAACATTTAACTATGGTAATA 
ATCCAGATCACGGGAATACTCCAAAGCCGAATAAGCCAAATGAAAACGGCGATTTGACATTGACCAAGACATGGG 
TTGATGCTACAGGTGCACCAATTCCGGCTGGAGCTGAAGCAACGTTCGATTTGGTTAATGCTCAGACTGGTAAAGT 
TGTACAAACTGTAACTTTGACAACAGACAAAAATACAGTTACTGTTAACGGATTGGATAAAAATACAGAATATAA 
ATTCGTTGAACGTAGTATAAAAGGGTATTCAGCAGATTATCAAGAAATCACTACAGCTGGAGAAATTGCTGTCAA 

45 GAACTGGAAAGACGAAAATCCAAAACCACTTGATCCAACAGAGCCAAAAGTTGTTACATATGGTAAAAAGTTTGT 
CAAAGTTAATGATAAAGATAATCGTTTAGCTGGGGCAGAATTTGTAATTGCAAATGCTGATAATGCTGGTCAATAT 
TTAGCACGTAAAGCAGATAAAGTGAGTCAAGAAGAGAAGCAGTTGGTTGTTACAACAAAGGATGCTTTAGATAGA 
GCAGTTGCTGCTTATAACGCTCTTACTGCACAACAACAAACTCAGCAAGAAAAAGAGAAAGTTGACAAAGCTCAA 
GCTGCTTATAATGCTGCTGTGATTGCTGCCAACAATGCATTTGAATGGGTGGCAGATAAGGACAATGAAAATGTTG 

50 TGAAATTAGTTTCTGATGCACAAGGTCGCTTTGAAATTACAGGCCTTCTTGCAGGTACATATTACTTAGAAGAAAC 
AAAACAGCCTGCTGGTTATGCATTACTAACTAGCCGTCAGAAATTTGAAGTCACTGCAACTTCTTATTCAGCGACT 
GGACAAGGCATTGAGTATACTGCTGGTTCAGGTAAAGATGACGCTACAAAAGTAGTCAACAAAAAAATCACTATC 
CCACAAACGGGTGGTATTGGTACAATTATCTTTGCTGTAGCGGGGGCTGCGATTATGGGTATTGCAGTGTACGCAT 
ATGTT A A A A A C A AC A A AG ATG AG G AT C A A CTTG CTT A A 

55 

4191.4 

ATGACAATGCAGAAAATGCAGAAAATGATTAGTCGTATCTTCTTTGTTATGGCTCTGTGTTTTTCTCTTGTATGGGG 

TGCACATGCAGTCCAAGCGCAAGAAGATCACACGTTGGTCTTGCAATTGGAGAACTATCAGGAGGTGGTTAGTCA 

ATTGCCATCTCGTGATGGTCATCGGTTGCAAGTATGGAAGTTGGATGATTCGTATTCCTATGATGATCGGGTGCAA 

60 ATTGTAAGAGACTTGCATTCGTGGGATGAGAATAAACTTTCTTCTTTCAAAAAGACTTCGTTTGAGATGACCTTCC 
TTGAGAATCAGATTGAAGTATCTCATATTCCAAATGGTCTTTACTATGTTCGCTCTATTATCCAGACGGATGCGGT 
TTCTTATCCAGCTGAATTTCTTTTTGAAATGACAGATCAAACGGTAGAGCCTTTGGTCATTGTAGCGAAAAAAACA 
GATACAATGACAACAAAGGTGAAGCTGATAAAGGTGGATCAAGACCACAATCGCTTGGAGGGTGTCGGCTTTAAA 
TTGGTATCAGTAGCAAGAGATGTTTCTGAAAAAGAGGTTCCCTTGATTGGAGAATACCGTTACAGTTCTTCTGGTC 

65 AAGTAGGGAGAACTCTCTATACTGATAAAAATGGAGAGATTTTTGTGACAAATCTTCCTCTTGGGAACTATCGTTT 
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CAAGGAGGTGGAGCCACTGGCAGGCTATGCTGTTACGACGCTGGATACGGATGTCCAGCTGGTAGATCATCAGCT 

GGTGACGATTACGGTTGTCAATCAGAAATTACCACGTGGCAATGTTGACTTTATGAAGGTGGATGGTCGGACCAA 

TACCTCTCTTCAAGGGGCAATGTTCAAAGTCATGAAAGAAGAAAGCGGACACTATACTCCTGTTCTTCAAAATGGT 

AAGGAAGTAGTTGTAACATCAGGGAAAGATGGTCGTTTCCGAGTGGAAGGTCTAGAGTATGGGACATACTATTTA 

TGGGAGCTCCAAGCTCCAACTGGTTATGTTCAATTAACATCGCCTGTTTCCTTTACAATCGGGAAAGATACTCGTA 

AGGAACTGGTAACAGTGGTTAAAAATAACAAGCGACCACGGATTGATGTGCCAGATACAGGGGAAGAAACCCTTG 

TATATCTTGATGCTTGTTGCCATTTTGTTGTTTGGTAG 

4191.5 

ATGAGCCACATATACTTATCTATTTTCACAAGTCTCTTGCTGATGCTAGGACTTGTCAATGTTGCTCAAGCCGATG 
AATATTTACGCATCGGTATGGAAGCAGCATATGCTCCCTTTAACTGGACCCAGGATGATGATAGCAACGGAGCTG 
TCAAAATCGATGGGACCAATCAGTATGCCAACGGATACGATGTTCAAATCGCCAAGAAAATCGCTAAGGACTTAG 
GTAAAGAACCTTTGGTTGTTAAAACCAAGTGGGAAGGTCT 

TATCGCAGGTATGAGTCCAACTGCAGAACGCAAACAAGAAATTGCCTTTTCGAGCAGTTACTATACTAGCGAACC 
AGTTTTGCTTGTCAAAAAAGATTCTGCCTACGCAAGTGCTAAATCTTTGGATGACT^ 

TCTCAACAAGGGGTCTACCTTTATAACTTGATTGCACAAATCCCAGGTGCTAAAAAAGAAACAGCCATGGGAGAC 

TTCGCTCAAATGCGACAAGCTCTTGAGGCTGGTGTCATTGATGCTTATGTTTCTGAACGTC 

CTGAAGCTGCGAACTCTAAGTTCAAGATGATTCAAGTAGAACCT 

TCGCTATCGGGCITCGTAAAAATGACAATCGTATTAGCCAAATCAATGCCAGCATTGAAACCATTTCAAi^ 

ACCAAGTTGCCTTGATGGATCGTATGATCAAGGAACAACCTGCCGAAGCTACAACAACTGAAGAGACTAGCAGTA 

GTTTCrTTTAGCCAAGTTGCTAAAATTCTTTCTGAAAACTGGCAACAACTCTTG 

ATCTCTATCGTCGGAACCATCATAGGTCTCATTATTGGACTTGCCATTGGTGTCITCCGTACTGCrCCTCT 

AAACAAAGTCATTTACGGCCTACAAAAACTAGTCGGCTGGGTTCTCAA 

CCAATGATTGTTCAATCGATGGTTATCTACTATGGAACTGCCCAAGCTTTCGGGATCAACC 

TTGACCGTACACTGGCTGCTATCITCATCGTTTCAATCAATACCGGTGCCTACATGACTGAAATCGTCCGTGGTGG 
TATCCTAGCAGTTGACAAGGGACAATTTGAAGCTGCGACTGCTCTTGGTATGACCCATAACCAGACCATGCGTAA 
GATTGTCCTACCTCAGGTAGTCCGTAACATCCTACCTGCAACTGGTAATGAATTTGTCATCAATATCAAAGATACA 
TCTGTATTGAACGTTATCTCTGTTGTCGAACTTTATTTCT 

CCAGACATTTACAATCATCGCCGTGATTTACTTTGTCCTCACCTTCACCGTAACACGTATCCTACGCTTTATCGAGC 
GCAGAATGGACATGGATACCTACACTACAGGTGCTAACCAAATGCAAACGGAGGATTTGAAATAA 

4191.6 

ATGACACAAGCAATCCTTGAAATTAAACACCTCAAAAAATCCTATGGACAAAACGAAGTGCTAAAAGA 
CTCACTGTCCACAAGGGAGAGGTCATCTCTATCATCGGAAGCTCTGGAAGCGGAAAATCGACCTTCCTACGCTCC 
ATTAACCTACITGAAACACCAACTGATGGACAAATCCTTTATCATGGACAAAACGTCCTCGAAAAAGGCTATGAC 
CTCACGCAATACCGTGAAAAGTTGGGGATGGTTTTCCAATCCTTTAACCT 

ACACAATCGTCGCTCAGACAACTGTCCTAAAACGCGAACGCACAGAAGCTGAAAAGATTGCCAAAGAAAACCTG 

GAAAAGGTCGGCATGGGAGAACGCTACTGGCAAGCCAAACCAAAACAACTCTCAGGTGGTCAAAAACAACGTGT 

GGCCATCGCTCGTGCCCTCTCCATGAATCCGGACGCTATTCTCTTTGATGAACCAACATCAGCTCTCGATCCAGAA 

ATGGTTGGAGAAGTCCTCAAAATCATGCAGGACCTGGCTCAGGAAGGCTTGACCATGATTGTCGTAACCCATGAA 

ATGGAATTTGCCCGTGATGTCTCTCACCGTGTTATCTTTATGGATAAGGGCGTGATCGCTGAAGAAGGTAAACCAG 

AAGACCTCTTCACCAATCCTAAAGAAGACCGAACAAAAGAGTTCCTTCAACGCTATCTCAAATAA 

4192.3 

ATGAAAAAGTATCAACTTCTATTCAAAATAAGTGCAGTCTTCTCT^ 

GACGCTTATCGTCCAAAACTATTGGCAATTTTCTTCTCAGATAGGCAATTTATTCTGGATTCA 

TACTTTTTATTGGAGTCATGATTGTGGTTCTTGTTAAGACAGGCCATGGTTATCTCTTCCGCATT 

ATGGCTTTGGTATTCGATTTTGACAGTATTAGTGCTAGTGTTCCAGATCTCTTTTAACGTTCAGACAGCT 

TTCAGTCAACTGCGGAAGGTTGGGCTGTATTGATTGGTTATAGTGGGACTAACITTGCAGAGCTAGGTATTTA^ 

AGCCCTGTTCTTTCTGGTTCCACTGATGGAAGAATTGA 

CGCGATTTGGTCTTGATTTGCTTCTTCCTTCTATTTTATTTGCTCT 

ATATCTTCGTCTTTGCAACAGTTGGAATCATCnTTGCTGGTTTGACCCGCTATACCAAGAGCATTTATCCATCCTAT 
GCGGTGCATGTGATCAATAATATTGTAGCGACCTTCCCGTTTTTGCTC 

4193.1 

ATGAACAAGAAACAATGGCTAGGTCTTGGCCTAGTTGCAGTGGCAGCAGTTGGACTTGCTGCATGTGGTAACCGC 

TCnTCTCGTAACGCAGCTTCATCTTCTGATGTGAAGACAAAAGCAGCAATCGTCACTGATACTGGTGGTGTTGATG 

ACAAATCATTCAACCAATCAGCTTGGGAAGGTTTGCAGGCTTGGGGTAAAGAACACAATCTTTCAAAAGATAACG 

GTTTCACTTACTTCCAATCAACAAGTGAAGCTGACTACGCTAACAACTTGCAACAAGCGGCTGGAAGTTACAACCT 

AATCTTCGGTGTTGGTTTTGCCCTTAATAATGCAGTTAAAGATGCAGCAAAAGAACACACTGACTTGAACTATGTC 

TTGATTGATGATGTGATTAAAGACCAAAAGAATGTTGCGAGCGTAACTTTCGCTGATAATGAGTCAGGTTACcTTG 

CAGGTGTGGCTGCAGCAAAAACAACTAAGACAAAACAAGTTGGTTTTGTAGGTGGTATCGAATCTGAAGTTATCT 

CTCGTTTTGAAGCAGGATTCAAGGCTGGTGTTGCGTCAGTAGACCCATCTATCAAAGTCCAAGTTGACTACGCTGG 

TTCATTTGGTGATGCGGCTAAAGGTAAAACAATTGCAGCCGCACAATACGCAGCCGGTGCAGATATTGTTTACCA 

AGTAGCTGGTGGTACAGGTGCAGGTGTCTTTGCAGAGGCAAAATCTCTCAACGAAAGCCGTCCTGAAAATGAAAA 
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AGTTTGGGTTATCGGTGTTGATCGTGACCAAGAAGCAGAAGGTAAATACACTTCTAAAGATGGCAAAGAATCAAA 
CTTTGTTCTTGTATCTACTTTGAAACAAGTTGGTACAACTGTAAAAGATATTTCTAACAAGGCAGAAAGAGGAGAA 
TTCCCTGGCGGTCAAGTGATCGTTTACTCATTGAAGGATAAAGGGGTTGACTTGGCAGTAACAAACCTTTCAGAAG 
AAGGTAAAAAAGCTGTCGAAGATGCAAAAGCTAAAATCCTTGATGGAAGCGTAAAAGT 
5 TCCTGAAAAATAA 

4193.3 

ATGTCTAAAAAATTACAACAAATTTCGGTTCCCTTGATTTCTGTATTCCTAGGAATTTTACT 
GTGGATCrrCGGTTATGATGCTATTTGGGGCTACGAAGAATTGTTCTATACAGCCTTTGGCAGTCT 

10 GGAGAAATCTTCCGTGCTATGGGTCCTCTGGTCTTGATTGGTCTTGGTTTTGCCGTTGCCAGTCGAGCTGG 

TAACGTCGGACTTCCTGGTCAGGCTTTGGCAGGTTGGATTCTCAGTGGTTGGTTTGCCCTGTCGCATCCAGATATG 
CCCCGTCCCTTGATGATTCTAGCAACCATCGTGATTGCCTTGATTGCTGGTGGGATTGTCGGAGCGATTCCAGGTA 
TTCTTAGGGCCTATCTAGGGACGTCAGAGGTTATTGTAACCATCATGATGAACTACATTGTCTTGTATGTAGGGAA 
TGCCTTTATCCATGCTTTCCCTAAAGACTTCATGCAAAGTACAGATTCGACCATTCGTGTT 

15 TATCAGACACCTTGGTTGGCTGAGTTGACTGGTAACTCACGGATGA 

TGCAGTTATTTGGTTCATGCTCAAGAAAACAACTCITGGTTTTGAAATCCGTC 
CAGAATATGCrGGTATTTCTGCCAAGCGGACTATTATCCTATC 
GGAGCTGTTGAAGGTTTGGGAACCTTCCAGAACGTCTATGTrCAAGGTTCGTC 
TGGCGGTTAGTTTGCTTGCGGCCAACTCACCAATTGGTATACTCTTTGCAGCCITCCTATTTGGC 

20 GGGGCTCCTGGTATGAATGCGGCGCAGGTACCATCTGAGCTTGTCAGCATTGTAACAGCGTCT^ 
TCAGTGTTCATTACCTTATCGAACGCTTTGTCAAACCGAAAAAACAAGTTAAAGGAGGTAAGTAA 

4194.1 

ATGGGAGTGAAAAAGAAACTAAAGTTGACTAGTTTGCTAGGACTGTCTCTGTTAATCATGACAGCCTGTGCGACT 

25 AATGGGGTAACTAGCGATATTACAGCCGAATCGGCTGATTTTTGG 

TTCG CTTTTT ATCGTTTG AT ATT AGT ATCG G AGTG G G G ATT ATTCTCTTT ACGGTCTTG ATTCGT AC AGTCCTCTTG 
CCAGTCTITCAGGTGCAAATGGTGGCTTCTAGGAAAATGCAGGAAGCTCAGCCACGCATrAAGGCGCTTCGAGAA 
CAATATCCAGGTCGAGATATGGAAAGCAGAACCAAACTAGAGCAGGAAATGCGTAAAGTATTTAAAGAAATGGG 
TGTCAGACAGTCAGACTCTCTTTGGCCGATTTTGATTCAGATGCCGGT^ 

30 GAGTTGACTTTTTAAAGACAGGTCATTTCTTATGGATTAACC^ 

TTAGCAGCAGTATTCACCTTTTTAAGTACTTGGTTGTCCAACAAAGCnTTGTCTGAGCGA^ 
CGATGATGTATGGGATTCCAGTCTTGATTTTTATCTTTGCAGTTTATGCGCCAGGTGGAGTCGCCCTATACTG 
AGTGTCTAATGCTTATCAAGTCITGCAAACCTATTTCTTGAATAATCCATTCAAGATTATCGCAG 
GTAGTACAGGCACAAAAAGATTTGGAAAATAGAAAAAGAAAAGCCAAGAAAAAGGCTCAGAAAACGAAATAA 

35 

4194.4 

ATGGTT ATCG ATCCATTTGCTATCAACGAACTAGACT ATT ACTTAGTTTCACACTTCCACAGT 

CATACACAGCTGCAGCAATTCTCAATAATCCTAAGTTAGAGCATGTTAAGTTTATCGGTCCTTACCACTGTGGACG 
AATCTGGGAAGGATGGGGTGTTCCAAAAGAACGTATCATCGTTGTTAAACCAGGTGACACTATCGAATTAAAAGA 
40 TATGAAGATTCATGCAGTAGAATCATTTGACCGTACTTGCTTGGTAACrCTCCCAGTGAACGGTGCTGATGAGACA 
GGCGGTGAACITGCTGGCnTGGCTGTTACAGATGAAGAAATGGCrCAAAAGGCrGTTAACrATA^ 
CAGGTGGAACCATCTATCATGGTGCAGATTCTCACTTCTCAAACTATTTTGCAAAA 

TGATGTTGCTTTGAATAACTATGGTGAAAATCCGGTAGGTATCCAAGACAAAATGACATCTATCGACCTTCTTCGT 
ATGGCAGAAAATCTGCGTACCAAAGTCATTATCCCAGTTCACTATGATATCTGGTCTAACTTCATGGCTTCTACTA 
45 ATGAGATTCTAGAACTTTGGAAAATGCGAAAAGATCGCTTGCAATACGATTTCCATCCATTTATCTGGGAA 

CGGTAAGTACACTTATCCTCAAGATCAACACTTAGTAGAATACCATCATCCACGTGGTTITGATGATTGTTTTGAA 
C AAG ACTCT A A C ATTC AATTT AA AG CTTTG CT AT A A 



50 

4196.2 

ATGTTCCTTTCAGGCTGGTTGTCTAGTTTTGCTAATACTTATATCCATGATTTACT 
TCCATTTTTAAATGCCTTTGAAAGTGCTATTGCGGCrCCTTTGGTAGAAGAACCCT^ 
55 TTGTTITGGCITTGATTCCTGTGCGAAAATTAAAATCTTTGTTT^ 

ATGATTAAGGATATTGGTTATATTCGTACGGATTTGCCAGAGGGCTTTGACTTTACTATTTCGCGA 
GTATCATCTCAGGAATTGCCTCTCACTGGACTTTTTCAGGTCTAGCTGTAGTAGGTC 

TATAAAGGACAGAAGGTTGGCAAGAAACAGGGCCTTATITTTCTAGGTTTAGCCTTGGGAACTCACTTCTTG 
ACTCTCCTTTTGTGGAGTTGGAAACAGAGTTGCCTTTAGCGATTCCAGTGGTTACGGCTATTGCTCTC 
60 TATCATGCTTATTGCTTTGTTGAGAAACACAATGAGTTGATGACCTAG 

4197.1 

ATGAAGGTGGAACCACGTTGCGACGTCCTTTCGAGGATGTCGCATTTTTTTATC 

AAGAATTAGTGGAGCGCAGTTGGGCAATCCGACAAGCTTATCACGAACTGGAAGTTAAGCATCATGATTCCAAGT 
65 GGACGGTAGAAGAAGACCTCTTGGCTTTATCTAATGATATTGGAAATTTCCAACGACTGGTGATGACAAAGCAAG 
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GACGCTACTATGATGAAACACCCTACACACTGGAACAAAAACTTTCAGAAAATATCTGGTGGCTATTAGAACTTT 
CTCAACGTTTGGATATAGACATTCTGACGGAAATGGAAAACTTCCTCTCTGATAAAGAAAAGCAATTGAACGTTA 
G G A CTTG G A AGT AG 

5 4197.4 

ATG CTTG ATTG G AAAC AATTTTTTCT AG CCT ATCTG CG CT C CCGT AGTCGTCTTTTT ATCT ATCTG CTTTCTTTGGC 
ATTTCTTGTCTTACTCTTTCAGTTTTTATTTGCCAGTCTAGGAATTTA 

GTAACCATATTATTTTTCACTTGGGACATATTGGTGGAAACGCAGGTCTATCGCCAGGAACTITCTCTATGGAGAGA 
GGGAAGCCAAGTCTCCTTTGGAAATAGCTTTAGCAGAAAAATTAGAAGCGCGTGAGATGGAACTCTATCAGCAGA 
10 GGTCAAAAGCAGAAAGAAAACTGACGGATTTGCTGGATTACTATACCTTGTGGGTCCATCAGATAAAGACCCCCA 
TTGCAGCCAGTCAACTCTTAGTTGCAGAAGTGGTCGACCGCCAACTGAAGCAGCAGCTAGAACAGGAAATTTTCA 
AAATCGACTCCTATACCAACCTAGTTTTACAGTACCTGCGTTTAGAAAGTTC 
GGTTCAAATTGAGGACTTGGTCAAGGAAATAATTCGTAAATATGCTCnTTTCI^ 

AATCTACATGACC1TGATAAAGAAATCGTGACGGATAAAAAGTGGCTGCTAGTGGTTATTGAGCAAATCATCTCA 
15 AACAGTCTCAAGTACACCAAGGAAGGTGGTCTGGAGATTTATATGGATGACCAAGAGCTTTGTATCAAAGATACG 
GGAATCGGGATAAAAAACAGTGATGTCCTCCGAGTATTTGAACGTGGCITTTCAGGATACAATGGCCGTTTGACCC 
AGCAGTCCTCTGGACTTGGCCITTATCTATCTAAGAAAATTTCTGAAGA 

TGAGGTCGGAAAAGGAACGACAGTGCGGATTCAGTTTGCTCAAGTGAACTTAGTCCTTGAGTAA 
20 4211.2 

ATGGAACTTAATACACACAATGCTGAAATCTTGCTCAGTGCAGCTAATAAGTCCCACTATCCGCAGGATGAACTG 
CCAGAGATTGCCCTAGCAGGGCGTTCAAATGTTGGTAAATCCAGCTTTATCAACACTATGTTGAACCGTAAGAATC 
TCGCCCGTACATCAGGAAAACCTGGTAAAACCCAGCTCCTGAACTTTTTTAACATTGATGACAAGATGCGCTTTGT 
GGATGTGCCTGGTTATGGCTATGCTCGTGTTTCTAAAAAGGAACGTGAAAAGTGGGGGTGCATGATTGAGGAGTA 
25 CTTAACGACTCGGGAAAATCTCCGTGCGGTTGTCAGTCTAGTTGACCTTCGTCATGACCCGTCAGCAGATGATGTG 
CAGATGTACGAATTTCTCAAGTATTATGAGATTCCAGTCATCATTGTGGCGACCAAGGCGGACAAGATTCCTCGTG 
GTAAATGGAACAAGCATGAATCAGCAATCAAAAAGAAATTAAACITTGACCCGAGTGAC^ 
ATCTGTCAGTAAGGCAGGGATGGATGAGGCTTGGGATGCAATCTTAGAAAAATTGTGA 

30 4211.3 

ATGACAAAGAAACAACTTCACTTGGTGATTGTGACAGGGATGAGTGGCGCAGGGAAAACTGTAGCCATTCAGTCC 
TTCGAGGATCTAGGTTATTTCACCATTGATAATATGCCGCCAGCTCTCTTGCCTAAGTTTTTGCAGCT 
TAAGGAAGACAATCCTAAGTTGGCCTTGGTAGTGGATATGCGTAGCCGTTCTITCTTTTCAGAGATTCA^ 
TTGGATGAGTTGGAAAATCAAGATGGTTTGGATTTCAAAATCCTCTTTTTGGATGCGGCT^ 

35 CTCGTTACAAGGAAACCAGACGGAGTCACCCACTAGCAGCAGACGGTCGTATTTTAGATGGAATCAAGTTGGAAC 
GTGAACTCTTGGCACCTTTGAAAAATATGAGCCAAAATGTGGTGGATACGACTGAACTCACTCCACGTGA 
CAAAACCCITGCAGAGCAGTTTTCAGACCAAGAACAAGCCCAGTCTTTCCGTATCGAAGTCATGTCTTTCGGATTT 
AAGTATGGAATCCCGATTGATGCGGACTTGGTCTTTGATGTCCGTTTCTTGCCAAATCCCTATTAT^ 
GAGAAACCAAACGGGTGTGGATGAACCTGTTTATGATTATGTCATGAACCATCCTGAGTCAGAAGACTTTTATCAA 

40 CATTTATTGGCCTTGATTGAGCCGATTCTGCCAAGTTACCAAAAGGAAGGTAAGTCCGTTTTGACCATTGCC 

GATGTACGGGTGGACAACACCGTAGTGTGGCATTTGCTAAACGCTTGGCGCAGGACTTATCCAAGAATTGGTCTGT 
TAATGAAGGGCATCGCGACAAAGACCGCAGAAAGGAAACGGTAAACCGTTCATGA 

4211.4 

45 ATGAGAAAACCAAAGATAACGGTGATTGGTGGAGGGACTGGAAGTCCCGTCATTCTAAAAAGTCTGCGGGAAAAA 
GATGTGGAAATCGCAGCTATCGTGACGGTGGCAGATGATGGTGGTTCTTCAGGTGAACTCCGAAAAAATATGCAA 
CAGTTGACACCGCCAGGTGATCTTCGTAATGTCCTTGTGGCCATGTCGGATATGCCTAAGTTTTATGAGAAGGTCT 
TTCAGTATCGGTTCTCTGAGGATGCCGGAGCCTTTGCTGGCCATCCATTGGGAAATCT^ 

AGAAATGCAGGGTTCAACCTATAATGCCATGCAGTTATTGAGCAAATTTTTCCATACAACAGGGAAAATTTATCCT 
50 TCCAGTGACCATCCTTTGACCCTTCATGCAGTCTTTCAGGATGGGACAGAAGTGGCTGGAGAGAGTCATATTGTAG 
ACCATCGAGGCATAATTGACAATGTCTATGTGACCAATGCCCTAAACGATGATACGCCTCTGGCCAGCCGTCGAG 
TAGTGCAGACCATCCrTGAAAGTGACATGATTGTCCTAGGGCCAGGTTCCCTCTTTACCTCTATTTTGCCCAATAT 
CGTGATTAAGGAAATTGGGCGGGCTCITTTGGAAACCAAGGCAGAAATTGCCTATGTCTGCAATATCATGACCCA 
ACGTGGGGAGACGGAACACTITACAGATAGCGACCACGTGGAAGTCTTGCATCGTCACCTTGGTCGCCCTTTTATC 
55 GACACTGTCnTGGTGAATATTGAAAAAGTGCCTCAGGAATACATGAATTCCAACCGTTTTGATGAATACTTAGTGC 
AAGTGGAACACGATTTTGTAGGTCTTTGTAAGCAAGTTTCGCGCGTGA 

TGGCGGTGCCTTCCACGATGGAGATTTGATTGTGGACGAGTTGATGCGCATTATACAGGTGAAAAAATGA 
4213.1 

60 ATGAAAAATTTGATAAAGrTGCTAATAATTAGATTGATTGTTAACTTAGCAGACAGTGTATTTTATATAGTAGCAT 
TGTGGCACGTTAGCAATAATTATTCTTCGAGCATGTTCTTAGGAATATTTATTGCAGTAAATTATCTACCGGATTTG 
TTACTAATCTTTTTTGGACCAGTTATTGACAGAGTAAATCCGCAAAAAAT^ 

AGCAGTGGCTGTAATATTTTTATTATTATTAAACCAAATATCATTTTGGGTGATAATGAGTCTAGTGTTTATTT 
TAATGGCTAGCTCCATAAGTTACGTGATAGAAGATGTGTTGATTCCTCAAGTGGTAGAATATGATAAGATTGTATT 
65 TGCAAATTCTCTTTTTAGTATlTCGTATAAAGTATTAGATTCTATrnTAATTCATT 
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CAGTAGGATTTATTTTATTGGTTAAGATAGATATAGGCATATTT 

TTTAGAACTAGCAATGCGAATATAGAAAACTTCTCTTTCAAATATTACAAGAGAGAAGTGTTGCAAGGTACAAAG 

TTTATTTTAAATAATAAATTATTATTTAAAACCAGTATTTC 

GTAGTTGTACCGATTTTTTCTATTCGATATTTTGATGGTCCGATT^ 

GGTGGTATATTGGGAAATATGCTAGCGCCAATCGTAATAAAATATTTAAAATCGAATCAAATTGTTGGTGTATTTC 
TTTTTTTGAACGGCTCAAGTTGGTTAGTAGCAATTGTTATAAAAG^ 

GTTTTATGTCTAAAGGAGTCTTCAATATTATTTTTAATTCGTTGTACCAACAAATACCTCCAC 
AGGGTAAATACTACCATTGATTCTATTATTTCTTTTGGAATC^ 
TTTGAATATTGAATTAGTGTTAATTGCTATTAGCATACCTTATTTTTTGTTT^ 
ATTGAAAGAATTTAGTATATATTAG 

4213.2 

ATGATGTCTAACAAAAATAAGGAAATTCTGATTTTTGCGATTCT 
ATTGCTGGCTTCTTTAATGCCATCTGCCATTGCAAATTATCTTGTTTATGTAGTTTr 

GTTCAAGGATAGATTGATCCAACAATGGAAGGAGATTAGAAAGACTAAAAGAAAATTCTTCTTTGGAGTCTTAAC 
AGGATGGCTCTTTCTCATTCTGATGACTGTTGTCTTTGAATTTGTATCAGAGATGTTGAAGCAGT^ 
ATGGACAAGGTCTAAATCAGTCTAATATTCAAAGTACCTTTCAAGAACAACCACTACTGATAGCTGTTTTTGCTTG 
TGTCATTGGACCTCTGGTAGAAGAATTATTTTTCCGTCAGGTCTT^ 

TACTAAGCATTATTCTGGTAGGACTTGTTTTTGCTCTGACTCATATGCACAGTTTGGCTCTATCAGAGTGGATTGGT 

GCAGTTGGTTACTTAGGTGGAGGCCTTGCCTTTTCTATTATTTATGTGAAAGAAAAAGAGAATATCTACT 

TACTTGTTCACATGTTAAGCAACAGCCTCTCCTTAATCATTTTAGCTATCAGTATAGTAAAATGA 



4224.1 

TTGAAAAAGCCAATTATCGAATTCAAAAACGTCTCTAAAGTTTTTGAAGACAGCAACACCAAGGTTCTCAAAGAC 

ATCAACTTTGAGTTGGAAGAAGGGAAATTCTACACCCTTCTAGGTGCATCTGGTTCGGGGAAATCAACTATCCTAA 

ACATTATTGCAGGTTTACTGGATGCGACGACAGGAGATATCATGCTAGACGGTGTTCGTATCAATGATATTCCAAC 

CAACAAGCGCGACGTACATACCGTCTTCCAATCCTATGCCTTGTTCCCACATATGAATGTGTTTGAAAATGTTGCC 

TTTCCACTTCGCITGCGTAAAATTGATAAGAAAGAAATCGAGCAGCGTGTAGCGGAAGTTCTCAAGATGGTTCAGT 

TGGAAGGTTATGAAAAACGTTCCATCCGCAAACTTTCTGGAGGACAACGTCAGCGTGTGGCCATCGCCCGTGCTA 

TCATCAACCAACCCCGTGTGGTCTTGTTGGACGAGCCTTTATCAGCGCTGGACTTGAAATTGAGAACAGACATGCA 

GTACGAATTGCGTGAATTACAACAACGATTGGGCATTACCTTTGTCTTTGTCACTCACGATCAGGAAGAAGCTCT^ 

GCCATGAGTGACTGGATTTTCGTTATGAATGATGGCGAGATTGTCCAGTCTGGAACCCCTGTGGACATCTACGATG 

AGCCAATCAACCACTTTGTTGCCACCTTTATCGGGGAGTCAAACATCTTGCCAGGTACCATGATTGAGGACTACTT 

GGTCGAATTTAACGGCAAACGCTTTGAAGCGGTTGATGGTGGGATGAAGCCAAATGAACCTGTTGAGGTCGTTAT 

TCGTCCAGAGGACTTGCGCATTACCCTTCCTGAAGAAGGCAAGCTCCAAGTTAAGGTCGATACCCAGCTTTTCCGT 

GGAGTTCATTATGAAATTATCGCCTATGACGAACTTGGAAATGAATGGATGATCCACTCAACCCGTAAGGCTATCG 

TGGGTGAGGAAATCGGTCTGGACTTTGAACCAGAAGACATCCACATCATGCGTCTCAATGAAACCGAAGAAGAGT 

TCGATGCTCGTATTGAGGAGTACGTAGAAATCGAAGAGCAAGAAGCAGGTTTGATCAATGCAATCGAGGAGGAAA 

GAGATGAAGAAAACAAGCTCTAA 

4252.1 

ATGAAATCAATGAGAATCTTATTTTTGTTAGCTTTAATTCA 
GCATCITGTCCTTTAAACAAAGTACAGCTTTTTTCATCG 

AAATTATCTTTATACCCGTAAGCAAGAAGTCCATAGTGTCCTAGCCAGTAAGAAGTCGGTGAAGCTTTTTTACAGT 

ATGTTACTCITAATTAATTTGTTAGGAGCTGTTCTTGT^ 

AGAATTAGTTGACTTTTTATTGCCATCCTTCTTTTC 

ATACGTGCGCGATTTTCTTGCTATGCTGGACAGAAAAAAGACAGTGTTGGTGACTATTTTAGCAACACTT 
TTAAGAAATCCAATGACCATTGTCTCACTTCTGATTTATATTGGACTGGGCTTGTTTTTTGCAGC 
AAATTCGGTTAAGAAGGAAGTTTCCTTTTATGGTCATATTTTCCGAGATCTTGTATTGGTCATTGTTACGCTCAT^ 
TCTTTTAG 

4252.2 

ATGGTT A A AA A AATT ATTGG A ATG GTG CT AG CTTT A CTTTCTGT A ACTGT AGT AGG AGT AGGTGTTTTTG CTT AT AC 

TATTTATCAACAAGGGACAGAAACCTTAGCTAAAACCTATAAAAAAATCGGTGAAGAAACCAAGGTTAT^ 

GACTGAACCTCTAACCATTCTGTTAATGGGAGTGGACACCGGAAATGTTGAACGAACTGAAACTTGGGTCGGTAG 

AAGTGATAGCATGATCTTGATGACAGTGAATCCTAAAACGAAAAAAACAACAATGATGAGTTTAGAGCGGGATAT 

TCTGACGCGCATTGAATCAGGGAATGGTCAGGCTCATGAAGCGAAACTGAACTCAGCATATGCAGATGGTGGAGC 

AGAGCTTGCTATAGAAACCATTCAAAAAATGATGAATATCCATATTGATCGCTATGTGATGGTCAATATGAGAGG 

ATTGCAAAAACTAGTGGATGCAGTAGGAGGTATTACAGTCAATAATATCCTAGGTTTCCCAATTTCTATCAGTGAC 

CAAGAAGAATTTAATACTATTTCTATCGGTGTTGGGGAGCAACATATTGGGGGAGAAGAAGCCCTAGTCTATGCA 

CGAATGCGTTACCAAGATCCTGAGGGGGATTATGGTCGTCAAAAACGTCAACGTGAAGTTATTCAAAAAGTCATG 

GAAAAAGCTCTCAGTTTAAATAGCATTGGTCATTATCAAGAGATTCTAAAAGCTTTGAGTGACAATATGCAGACC 
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AATATTGATTTGTCTGCAAAAAGTATCCCTAACTTGCTAGGCTATAAAGATTCATTTAAAACCATTGAAACTCAGC 
AGTTGCAGGGTGAAGGAGAGATACTTCAAGGTGTTTCTTACCAGATTGTTTCGAGAGCACATATGTTGGAAATGCA 
AAATCTACTCCGACGTTCTTTGGGACAAGAAGAAGTTACTCAGCTTGA 
TTTGGCAGAGCACCTGTTGGTGATGAAGATAATTAA 

5 

4256.2 

ATGAAAAAACAAGCCTATGTCATTATTGCTCTCACCTCCTTCCTATTTGTCTTTTTTTTCTCCCACA 
AATACTTGATTTTGACTGGTCTATCITTTTGCACGATGTC^ 

TCAGCATGTCCATGACCTGTCTCTTAGCCCTGTTTTGGCGAGGGATCGAAGAGCTTTCTCTAAGAAAAATGCAGGC 
10 TAATCTCAAGCGTTTATTAGCAGGGCAAGAAGTGGTTCAGGTTGCAGATCCAGATTTGGATGCCAGTTTCAAGTCC 
TTATCAGGTAAACTTAACCTTTTGACAGAGGCTCTTCAAAAAGCTGAAAATCAGAGCCTTGCTCAGGAAGAGGAA 
ATCATCGAGAAGGAACGGAAGCGAATTGCTCGGGATTTGCACGATACAGTCAGTCAGGAGTTGTTTGCGGCCCAC 
ATGATTTTATCGGGTATCAGTCAGCAGGCTTTGAAATTGGATAGAGAAAAGATGCAGACCCAGTTGCAGAGTGTC 
ACAGCTATTTTAGAAACAGCCCAGAAGGATTTGCGGGTTTTGCTCTTGCATTTGCGACCAGTTGAACTG 
15 AGAGCnTGATAGAAGGGATTCAAATTCTTTTAAAAGAGCTTGAGGACAAGAG 

AGAATATGACGAAATTGCCTAAGAAAATCGAGGAGCATATCTTCCGTATCCTGCAAGAGTTGATTAGCAATACCC 
TCCGCCATGCCCAGGCATCTTGCCTAGATGTCTACCTCTATCAGACAGATGTTGAATTGCAACTGAAGGTGGTGGA 
CAATGGGATTGGTTTCCAGTTAGGGAGCTTAGACGACTTGAGTTATGGACTGCGAAATATCAAGGAGCGGGTTGA 
AGATATGGCTGGAACAGTTCAACTCTTGACAGCTCCCAAGCAAGGGCTGGCGGTTGATATCCGTATTCCCCTGTTA 
20 GATAAGGAATGA 

4263.1 

ATGATTGTTTCCATTATTTCTCAAGGATTTGTCTGGGCTATTCTAGGTCTGGGAATCTTTATGACATTT 
AAACTTTCCAGATATGACGACAGAAGGTTCCTTCCCTCTTGGGGGAGCTGTTGCTGTCACTTTGATAACCAAAGGC 
25 GTGAACCCATTTTTAGCGACACTTGTTGCTGTAGGAGCAGGTTGTTTG 

CAAAAGGGAAGATCCCAACCTTGCTCTCAGGGATTTTGGTGATGACTTCTTGTCACTCAATCATGCTC^ 

GGGACGTGCGAATTTAGGCCTGCTTGGAACCAAGCAAATTCAGGATGTTTTGCCT^ 

CTCTTGACAGGTCTCATCTTTGTGAGTATTGTTATTGCT 

CTATATTGCTACAGGGGATAATCCTGATATGGCTAGAAGTTTCGGGATTCATACTGGACGCATGGAGCTCATGGGC 
30 TTGGTCTTATCAAATGGTGTGATTGCCCTTGCAGGTGCCCTCATTGCTCAGCAAGAAGGTTATGCCGATGTGTCTC 
GAGGGATCGGGGTTATCGTTGTGGGGCTTGCAAGTTTGATTATTGGAGAAGTTATTTTCAAGAGTTTGAGCT 
AGAGCGTTTGGTTACTATCGTTGTAGGTTCTATCGCTTATCAATTTTTAGTGTGG 

ATACAAGTTACCTTCGTTTATACAGTGCCTTGATTTTAGCAGTCTGCCTCATGATTCCAACATTTAAGCAAACAAT 
CTTGAAAGGAGCCAAGTTAAGCAAATGA 

35 

4346.1 

ATGAAAAAAATGAAAGTTTGGTCTACTGTA(HTGCAACGGGAGTTGCTC 

GTTCAAATTCTACGACTGCTTCTTCATCTGAAGAAAAAGCTGATAAAAGTCAAGAATTAGTTATCTATTCGAACTC 
AGTCTCAAATGGTCGTGGTGATTGGTTAACTGCTAAAGCAAAAGAAGCTGGTTTTAATATAAAAATGGTTGATATC 

40 GCTGGCGCTCAATTAGCAGACCGTGTTATTGCTGAGAAGAATAATGCAGTTGCAGATATGGTATTTGGAATTGGTG 
CTGTTGATTCAAATAAAATTAGAGATCAAAAATTACTAGTACAGTACAAGCCTAAATGGTTAGATAAAATTGATC 
AATCTTTATCAGATAAAGATAATTATTATAATCCTGTGATTGTTCAACCATTAGTTTTAATTGGGGCGCCT 
AAAGAAATGCCTAAAGATTGGACTGAATTAGGTAGTAAGTATAAAGGTAAATATTCAATTTCTGGTCTTCAAGGA 
GGTACAGGACGGGCAATTCTAGCAAGTATCTTAGTTCGATACCTTGATGATAAAGGTGAATTAGGTGTTTCCGAAA 

45 AAGGTTGGGAAGTAGCAAAAGAATATTTGAAAAATGCATACACTCTTCAAAAGGGAGAAAGTTCAATTGTTAAGA 
TGTT AG AC AA AG AAG ATC C AAT AC AAT ATG G AATG ATGTGG G GTTCTG GTG C ATT AGTTG G AC AAAAAG AAC AAA 
ATGTTGTTITCAAAGTTATGACTCCTGAGATTGGTGTACCATTTGTAACTGAACAAA 

TAAAAAACAAGCGTTAGCTAAAGAATTTATTGATTGGTTTGGTCAATCAGAAATTCAAGTAGAATATAGTAAGAA 
CTTTG G ATCT ATTC CTG C AAAT AAAG ATG CC CT C AAAG ATCT A C CTG A AG AT A CG A AG A AATTTGTTG ATC AAGTG 
50 AAACCACAAAATATTGACTGGGAAGCTGTTGGAAAGCATTTGGATGAATGGGTAGAAAAAGCTGAATTAGAATAC 
GTACAATAA 

4346.2 

ATGATTAAATTTGATAATATTCAAATTAAATATGGTGATTTTGTTGCAATTGATAATCTGAATTTAGATATACATG 
55 AAGGGGAATTTTTTACATTTCTTGGGCCTTCAGGATGTGGTAAATCAACT 

AGATCCATCATCAGGAAGTATTGAAGTTAATGGAACAGATGTCACTCATTTGGAACCTGAAAAGCGTGGAATTGG 
TATTGTATTTCAATCTTATGCGCTATTTCCAACTATGACTGTTTTTGATAATATTGCATTTG 

AGGTAGCTCCAGATGTTATTAAAGCTAAAGTATCAGCAGTGGCAGCAAAAATTAAGATCTCTGATCAACAGTTAC 
AGCGTAATGTATCAGAATTATCTGGGGGTCAACAACAAAGGGTAGCATTGGCTCGTGCTCTGGTTCTTGAACCTAA 

60 AATTCTTTGTCTAGATGAACCATTGTCAAACCTTGACGCAAAATTACGTGTAGATTTGAGAAAAGAGTTGAAAA 

CTTCAAAAAGAGTTAGGTATTACTACTTTATATGTTACTCATGATCAAGAGGAAGCCTTGACTTTATCTGATAGAA 
TTGCAGTCTTTAACAATGGATACATCGAACAGGTCGGTACACCAGTAGAGATTTATCATAATTCTCAAACTGAATT 
TGTATGTGATTTTATTGGAGATATTAATGTTTTGACCGATGAAACAGTCCACGAAGTATTATTGAAAAATACAAGC 
GTTTTCTTAGAGGATAAAAAAGGATACATTCGATTAGAGAAAGTTCGATTCAATCGTGAAACTGAACAAGATTTTA 

65 TTCTAAAAGGGACAATTATTGATGTTGAGTTTTCTGGAGTTACAATTCACTATACAATAAAAGTTTCTGAAAGTCA 
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GATTCTTAATGTAACAAGTATTGATAGTCAGGCTGCTATTAGATCTGTCGGAGAAAGTGTGGAATTATTTATCACA 
CCATCAGACGTTCTGCAATTTTAA 

4346.3 

5 ATGCGTCATAAATTAAATTTAAAAGATTGGCTTATTCG 

TCCAAACTTTGATCTAGTAGTGAATGTATTTGTAAAAGGAGGAGAATTTTCCCTTGATGCTGTACATCGTGTTCT 
AAATCTCAGAGGGCACTTCAGAGTATTATGAACAGTTTTAAGTTAGCATTTTCACTCATTATTACA 
TAGGTATTCITTGTGTTCTATTTACAGAGTACTTTGATATTAAAGGTGCTAAAATTTT 
TCITTAATTTATGGAGGAGTGGTTTTAGCGACTGGATATAAATTTGTCTATGGTCCTT 
10 TTTACAAAATGTTATCCCTTCTTTAGACCCTAACTGGTTTAT^ 

CAGGAACTGCTAATCATACATTGTTTTTAACAAATACAATTCGAAGCGTTGACTATCACACTATTGAGGCTGCTCG 

AAATATGGGAGCAAAACCATTTACTGTTTTCCGAAAAGTAGTGTT^^ 

ACTATTATGGTTTTTCTTAGTGGTTTATCTGCAGTAGCAGCACCCATGATTGTTGGTGGTAAAGAATTTCAAACT 
AAATCCAATGATTATTACATTTGCAGGGATGGGGAATTCTCGTGATTTAGCTGCCCTACTTGCAAT^ 

15 ATTGCAACTACAATTTTGCTTACTATCATGAATAAGATAGAAAAAGGTGGAAATTATATTTCTATCT 
AAGCGCCTCTTAAAAAACAAAAAATTGCGTCTAAGCCTTGGAAT^^ 
CACAGTTTTCATGCITCCACTAATTTTTATAGTATTATACTCATTTACAGATCCAGTTG 
TAACATTATCAAACTTTACITTAGAAAATTATCGCTTATTCTTTAGTAATAGTGCGGCATT 
AGCTTTATTTATTCTATTATTGCTGCGACAACAGCAACAATTCTCGCAGTTGTATTTGCT 

20 TAAATCTCGTTTTGATTTCTTATTTGAATATGGTC 
GTTTATTATTTACTTTTAATCAGCCACAATTTCT^ 

ATTGCATATATAGTTGTAAAAATCCCATTTTCTTATAGAATGGTACGTGCTATTTTATTTAGTGTTGATGATG 
GGAAGATGCAGCAAGAAGTATGGGTGCTTCACCTTTTTATACTATGATGAAGGTTATCATTCCATTTATTTTACCG 

GTTGTTCTCTCTGTTATTGCnTTAAACTTTAACTCTTTATTAACTGACTTC 
25 CTAGCTCAACCATTAGGTATTACGATTCGATCTGCAGGTGATGAAACAGCAACATCTAATGCACAAGCTCTGGTAT 
TTGTTTATACAATTGTTCTGATGATTATTTCTGGAACGGTATTATACTTCACACAAAGACCGGGGCGTAAAGTAAG 
GAAATAA 
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MEELVTLDCLFIDRTKIEANANKYSFVWKKTTEKFSAKLQEQIQVYFQEEITPLLIKYAMFDKKQKRGYKESAKNLANW 
HYNDKEDSYTHPDGWYYRFHHTKYQKTQTDFQQEIKVYYADEPESAPQKGLYMNERYQNLKAKECQALLSPQGRQIF 
5 AQRKIDVEPVFGQIKASLGYKRCNLRGKRQVRIDMGLVLMANNLLKYSKMKZ 

MGKGHWNRKRVYSIRKFAVGACSVMIGTCAVLLGGNIAGESVVYADETLITHTAEKPKEEKMIVEEKADKALETKNIV 

ERTEQSEPSSTEAIASEKKEDEAVTPKEEKVSAKPEEKAPRIESQASNQEKPLKEDAKAVTNEEVNQMIEDRKVDFNQN 

WYFKLNANSKEAIKPDADVSTWKKLDLPYDWSIFNDFDHESPAQNEGGQLNGGEAWYRKTFKLDEKDLKKNVRLTF 

10 DGVYMDSQVYVNGQLVGHYPNGYNQFSYDITKYLQKDGRENVIAVHAVNKQPSSRWYSGSGIYRDVTLQVTDKVHV 
EKNGTTILTPKLEEQQHGKVETHVTSKIVNTDDKDHELVAEYQIVERGGHAVTGLVRTASRTLKAHESTSLDAILEVER 
PKLWTVLNDKPALYELITRVYRDGQLVDAKKDLFGYRYYHWTPNEGFSLNGERIKFHGVSLHHDHGALGAEENYKAE 
YRRLKQMKEMGVNSIRTTHNPASEQTLQIAAELGLLVQEEAFDTWYGGKKPYDYGRFFEKDATHPEARKGEKWSDFD 
LRTMVERGKNNPAIFMWSIGNEIGEANGDAHSLATVKRLVKVIKDVDKTRYVTMGADKFRFGNGSGGHEKIADELDA 

15 VGFNYSEDNYKALRAKHPKWLIYGSETSSATRTRGSYYRPERELKHSNGPERNYEQSDYGNDRVGWGKTATASWTFD 
RDNAGYAGQFIWTGTDYIGEPTPWHNQNQTPVKSSYFGIVDTAGIPKHDFYLYQSQWVSVKKKPMVHLLPHWNWENK 
ELASKVADSEGKIPVRAYSNASSVELFLNGKSLGLKTFNKKQTSDGRTYQEGANANELYLEWKVAYQPGTLEAIARDES 
GKEIARDKITTAGKPAAVRLIKEDHAIAADGKDLTYIYYEIVDSQGNVVPTANNLVRFQLHGQGQLVGVDNGEQASRER 
YKAQADGSWIRKAFNGKGVAIVKSTEQAGKFTLTAHSDLLKSNQVTVFTGKKEGQEKTVLGTEVPKVQTIIGEAPEMPT 

20 TVPFVYSDGSRAERPVTWSSVDVSKPGIVTVKGMADGREVEARVEVLALKSELPVVKRIAPNTDLNSVDKSVSYVLIDGS 
VEEYEVDKWEIAEEDKAKLAIPGSRIQATGYLEGQPIHATLVVEEGNPAAPAVPTVTVGGEAVTGLTSQKPMQYRTLA 
YGAKLPEVTASAKNAAVTVLQASAANGMRASIFIQPKDGGPLQTYAIQFLEEAPKIAHLSLQVEKADSLKEDQTVKLSV 
RAHYQDGTQAVLPADKVTFSTSGEGEVAIRKGMLELHKPGAVTLNAEYEGAKDQVELTIQANTEKKIAQSIRPVNVVT 
DLHQEPSLPATVTVEYDKGFPKTHKVTWQAIPKEKLDSYQTFEVLGKVEGIDLEARAKVSVEGIVSVEEVSVTTPIAEAP 

25 QLPESVRTYDSNGHVSSAKVAWDAIRPEQYAKEGVFTVNGRLEGTQLTTKLHVRVSAQTEQGANISDQWTGSELPLAF 
ASDSNPSDPVSNVNDKLISYNNQPANRWTNWNRTNPEASVGVLFGDSGILSKRSVDNLSVGFHEDHGVGVPKSYVIEY 
YVGKTVPTAPKNPSFVGNEDHVFNDSANWKPVTNLKAPAQLKAGEMNHFSFDKVETYAVRIRMVKADNKRGTSITEV 
QIFAKQVAAAKQGQTRIQVDGKDLANFNPDLTDYYLESVDGKVPAVTASVSNNGLATVVPSVREGEPVRVIAKAENGD 
ILGEYRLHFTKDKSLLSHKPVAAVKQARLLQVGQALELPTKVPVYFTGKDGYETKDLTVEWEEVPAENLTKAGQFTVR 

30 GRVLGSNLVAEITVRVTDKLGETLSDNPNYDENSNQAFASATNDIDKNSHDRVDYLNDGDHSENRRWTNWSPTPSSNP 
EVSAGVIFRENGKIVERTVTQGKVQFFADSGTDAPSKLVLERYVGPEFEVPTYYSNYQAYDADHPFNNPENWEAVPYR 
ADKDIAAGDEINVTFKAIKAKAMRWRMERKADKSGVAMIEMTFLAPSELPQESTQSKILVDGKELADFAENRQDYQIT 
YKGQRPKVSVEENNQVASTVVDSGEDSFPVLVRLVSESGKQVKEYRIHLTKEKPVSEKTVAAVQEDLPKIEFVEKDLAY 
KTVEKKDSTLYLGETRVEQEGKVGKERIFTAINPDGSKEEKLREVVEVPTDRIVLVGTKPVAQEAKKPQVSEKADTKPID 

35 SSEASQTNKAQLPSTGSAASQAAVAAGLTLLGLSAGLVVTKGKKEDZ 

MKIMKKKYWTLAILFFCLFNNSVTAQEIPKNLDGNITHTQTSESFSESDEKQVDYSNKNQEEVDQNKFRIQIDKTELFVT 
TDKHLEKNCCKLELEPQINNDIVNSESNNLLGEDNLDNKIKENVSHLDNRGGNIEHDKDNLESSIVRKYEWDrDKVTGG 
GESYKLYSKSNSKVSIAILDSGVDLQNTGLLKNLSNHSKNYVPNKGYLGKEEGEEGIISDIQDRLGHGTAVVAQIVGDDN 
40 INGVNPHVNINVYRIFGKSSASPDWIVKAIFDAVDDGNDIINLSTGQYLMIDGEYEDGTNDFETFLKYKKAIDYANQKGV 
IIVAALGNDSLNVSNQSDLLKLISSRKKVRKPGLVVDVPSYFSSTISVGGIDRLGNLSDFSNKGDSDAIYAPAGSTLSLSEL 
GLNNFINAEKYKEDWIFSATLGGYTYLYGNSFAAPKVSGAIAMIIDKYKLKDQPYNYMFVKKFWKKHYQZ 

MKKTWKVFLTLVTALVAVVLVACGQGTASKDNKEAELKKVDFILDWTPNTNHTGLYVAKEKGYFKEAGVDVDLKLP 
45 PEESSSDLVINGKAPFAVYFQDYMAKKLEKGAGITAVAArVEHNTSGIISRKSDNVSSPKDLVGKKYGTWNDPTELAML 
KTLVESQGGDFEKVEKVPNNDSNSITPIANGVFDTAWIYYGWDGILAKSQGVDANFMYLKDYVKEFDYYSPVIIANND 
YLKDNKEEARKVIQAIKKGYQYAMEHPEEAADILIKNAPELKEKRDFVIESQKYLSKEYASDKEKWGQFDAARWNAFY 
KWDKENGILKEDLTDKGFTNEFVKZ 

50 MKRTWRNSFVTNLNTPFMIGNIEIPNRTVLAPMAGVTNSAFRTIAKELGAGLVVMEMVSDKGIQYNNEKTLHMLHIDE 
GENPVSIQLFGSDEDSLARAAEFIQENTKTDIVDINMGCPVNKIVKNEAGAMWLKDPDKIYSIINKVQSVLDIPLTVKMR 
TGWADPSLAVENALAAEAAGVSALAMHGRTREQMYTGHADLETLYKVAQALTKIPFIANGDIRTVQEAKQRIEEVGA 
DAVMIGRAAMGNPYLFNQINHYFETGEILPDLTFEDKMKIAYEHLKRLINLKGENVAVREFRGLAPHYLRGTSGAAKL 

^ RGAISQASTLAEIETLLQLEKAZ 

MIKNPKLLTKSFLRSFAILGGVGLVIHIAIYLTFPFYYIQLEGEKFNESARVFTEYLKTKTSDEIPSLLQSYSKSLTISAHLK 
RDIVDKRLPLVHDLDIKDGKLSNYIVMLDMSVSTADGKQVTVQFVHGVDVYKEAKNILLLYLPYTFLVTIAFSFVFSYF 
YTKRLLNPLFYISEVTSKMQDLDDNIRFDESRKDEVGEVGKQINGMYEHLLKVIYELESRNEQIVKLQNQKVSFVRGAS 
HELKTPLASLRIILENMQHNIGDYKDHPKYIAKSINKIDQMSHLLEEVLESSKFQEWTECRETTVKPVLVDILSRYQELAH 
60 SIGVTIENQLTDATRVVMSLRALDKVLTNLISNAIKYSDKNGRVIISEQDGYLSIKNTCAPLSDQELEHLFDIFYHSQIVTD 
KDESSGLGLYIVNNILESYQMDYSFLPYEHGMEFKISLZ 

MYLGDLMEKAECGQFSILSFLLQESQTTVKAVMEETGFSKATLTKYVTLLNDKALDSGLELAIHSEDENLRLSIGAATK 
GRDIRSLFLESAVKYQILVYLLYHQQFLAHQLAQELVISEATLGRHLAGLNQILSEFDLSIQNGRWRGPEHQIHYFYFCL 
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FRKVWSSQEWEGHMQKPERKQEIANLEEICGASLSAGQKLDLVLWAHISQQRLRVNACQFQVIEEKMRGYFDNIFYLR 
LLRKVPSFFAGQHIPLGVEDGEMMIFFSFLLSHRILPLHTMEYILGFGGQLADLLTQLIQEMKKEELLGDYTEDHVTYEL 
SQLCAQVYLYKGYILQDRYKYQLENRHPYLLMEHDFKETAEEIFHALPAFQQGTDLDKKILWEWLQLIEYMAENGGQ 
HMRIGLDLTSGFLVFSRMAAILKRYLEYNRFITIEAYDPSRHYDLLVTNNPIHKKEQTPVYYLKNDLDMEDLVAIRQLLF 

5 TZ 

MEFSKKTRELSIKKMQERTLDLLIIGGGITGAGVALQAAASGLETGLIEMQDFAEGTSSRSTKLVHGGLRYLKQFDVEV 
VSDTVSERAVVQQIAPHIPKSDPMLLPVYDEDGATFSLFRLKVAMDLYDLLAGVSNTPAANKVLSKDQVLERQPNLKK 
EGLVGGGVYLDFRNNDARLVIENIKRANQDGALIANHVKAEGFLFDESGKITGVVARDLLTDQVFEIKARLVINTTGPW 
10 SDKVRNLSNKGTQFSQMRPTKGVHLVVDSSKIKVSQPVYFDTGLGDGRMVFVLPRENKTYFGTTDTDYTGDLEHPKVT 
QEDVDYLLGIVNNRFPESNITIDDIESSWAGLRPLIAGNSASDYNGGNNGTISDESFDNLIATVESYLSKEKTREDVESAV 
SKLESSTSEKHLDPSAVSRGSSLDRDDNGLLTLAGGKITDYRKMAEGAMERVVDILKAEFDRSFKLINSKTYPVSGGELN 
PANVDSEIEAFAQLGVSRGLDSKEAHYLANLYGSNAPKVFALAHSLEQAPGLSLADTLSLHYAMRNELTLSPVDFLLRR 
TNHMLFMRDSLDSIVEPILDEMGRFYDWTEEEKATYRADVEAALANNDLAELKNZ 

15 

MMNELFGEFLGTLILILLGNGVVAGVVLPKTKSNSSGWIVITMGWGIAVAVAVFVSGKLSPAYLNPAVTIGVALKGGLP 

WASVLPYILAQFAGAMLGQILVWLQFKPHYEAEENAGNILATFSTGPAIKDTVSNLISEILGTFVLVLTIFALGLYDFQA 

GIGTFAVGTLIVGIGLSLGGTTGYALNPARDLGPRIMHSILPIPNKGDGDWSYAWIPVVGPVIGAALAVLVFSLFZ 

20 MTKKKIERISVIHREKILWLKWYFMRDKEQPKYSVLERKMFDAAKNQDMLAYQKYATIKQITDIRVQTSEADILEAVKE 
VYVYNHMNVIGACQRILFISQSPAYDKLNKWFNIYSDLYFSVVPLPKMGVYHEMVGIZ 

MKNSNEAEMKLLYTDIRTSLTEILTREAEELVAAGKRVFYIAPNSLSFEKERAVLEYLSQQASFSITVTRFAQMARYLVL 
NDLPAKTTLDDIGLGLAFYKCLAELDPKDLRVYGAIKQDPQLIQQUELYHEMTKSQMSFLDLENLTDEDKRADLLLIF 

25 EKVTAYLNQGQLAQESQLSHLIEAIENDKVSSDFNQIALVIDGFTRFSAEEERVVDLLHGKGVEIVIGAYASKKAYTSPFS 
EGNLYQASVKFLHHLASKYQTPAQDCSQTHEKMDSFDKASRLLESSYDFSELALDVDEKDRENLQIWSCLTQKEELEL 
VARSIRQKLHENSDLSYKHFRILLGDVASYQLSLKTIFDQYQIPFYLGRSEAMAHHPLTQFVESILALKRYRFRQEDLINL 
LRTDLYTDLSQSDIDAFEQYIRYLGINGLPAFQQTFTKSHHGKFNLERLNVLRLRILAPLETLFASRKQKAEKLLQKWSV 
FLKEGAVTKQLQDLTTTLEAVEQERQAEVWKAFCHVLEQFATVFAGSQVSLEDFLALLHSGMSLSQYRTIPATVDTVL 

30 VQSYDLIAPLTADFVYAIGLTQDNLPKISQNTSLLTDEERQNLNQATEEGVQLLIASSENLKKNRYTMLSLVNSARKQLF 
LSAPSLFNESESKESAYLQELIHFGFRRREKRMNHKGLSKEDMGSYHSLLSSLVAYHQQGEMSDTEQDLTFVKVLSRVI 
GKKLDQQGLENPAIPTSPSSKTLAKDTLQALYPAKQEFYLSTSGLTEFYRNEYSYFLRYVLGLQEELRLHPDARSHGNFL 
HRIFERALQLPNEDSFDQRLEQAIQETSQEREFEAIYQESLEAQFTKEVLLDVARTTGHILRHNPAIETIKEEANFGGKDQ 
AFIQLDNGRSVFVRGKVDRIDRLKANGAIGVVDYKSSLTQFQFPHFFNGLNSQLPTYLAALKREGEQNFFGAMYLEMA 

35 EPVQSLMAVKSLAGAVVEASKSMKYQGLFLEKESSYLGEFYNKNKANQLTDEEFQLLLDYNAYLYKKAAEKILAGRF 
AINPYTENGRSIAPYVQQHQAITGFEANYHLGQARFLEKLDLADGKRLVGEKLKQAWLEKIREELNRZ 



MKLIPFLSEEEIQKLQEAEANSSKEQKKTAEQIEAIYTSAQNILVSASAGSGKTFVMAERILDQLARGVEISQLFISTFTVK 
40 AATELKERLEKKISKKIQETDDVDLKQHLGRQLADLPNAAIGTMDSFTQKFLGKHGYLLDIAPNFR1LQNQSEQLILENE 
VFHEVFEAHYQGKQKETFSHLLKNFAGRGKDERGLRQQVYKIYDFLQSTSNPQKWLSESFLKGFEKADFTSEKEKLTE 
QIKQALWDLESFFRYHLDNDAKEFAKAAYLENVQLILDEIGSLNQESDSQAYQAVLARVVAISKEKNGRALTNASRKA 
DLKPLADAYNEERKTQFAKLGQLSDQIAILDYQERYHGDTWKLAKTFQSFMSDFVEAYRQRKRQENAFEFADISHYTIE 
ILENFPQVRESYQERFHEVMVDEYQDTNHIQERMLELLSNGHNRFMVGDIKQSIYRFRQADPQIFNEKFQRYAQNPQEG 
45 RLIILKENFRSSSEVLSATNDVFERLMDQEVGEINYDNKHQLVFANTKLTPNPDNKAAFLLYDKDDTGEEEESQTETKL 
TGEMRLVIKEILKLHQEKGVAFKEIALLTSSRSRNDQILLALSEYGIPVKTDGEQNNYLQSLEVQVMLDTLRVIHNPLQD 
YALVALMKSPMFGFDEDELARLSLQKAEDKVHENLYEKLVNAQKMASSQKGLIHTALAEKLKQFMDILASWRLYAKT 
HSLYDLIWKIYNDRFYYDYVGALPNGPARQANLYALALRADQFEKSNFKGLSRFIRMIDQVLEAQHDLASVAVAPPKD 
AVELMTIHKSKGLEFPYVFILNMDQDFNKQDSMSEVILSRQNGLGVKYIAKMETGAVEDHYPKTIKLSIPSLTYRQNEEE 
50 LQLASYSEQMRLLYVAMTRAEKKLYLVGKGSREKLESKEYPAAKNGKLNSNTRLQARNFQDWLWAISKVFTKDKLNF 
SYRFIGEDQLTREAIGELETKSPLQDSSQADNRQSDTIKEALEMLKEVEVYNTLHRAAIELPSVQTPSQIKKFYEPVMDM 
EGVEIAGQGQSVGBCKISFDLPDFSTKEKVTGAEIGSATHELMQRIDLSQQLTLASLTETLKQVQTSQAVRDKINLDKILAF 
FDTVLGQEILANTDHLYREQPFSMLKRDQKSQEDFVVRGILDGYLLYENKIVLFDYKTDRYDEPSQLVDRYRGQLALY 
EEALSRAYSIENIEKYLILLGKDEVQVVKVZ 

55 

MELARHAESLGVDAIATIPPIYFRLPEYSVAKYWNDISSAAPNTDYVIYNIPQLAGVALTPSLYTEMLKNPRVIGVKNSS 
MPVQDIQTFVSLGGEDHIVFNGPDEQFLGGRLMGARAGIGGTYGAMPELFLKLNQLIADKDLETARELQYAINAIIGKL 
TSAHGNMYGVIKEVLKINEGLNIGSVRSPLTPVTEEDRPVVEAAAALIRETKERFLZ 

60 

MYKTKCLREKLVLFLKIFFPILIYQFANYSASFVDTAMTGQYNTMDLAGVSMATSIWNPFFTFLTGIVSALVPIIGHHLG 
RGKKEEVASDFYQFIYLALGLSVVLLGMVLFLAPIILNHIGLEAAVAAVAVRYLWFLSIGIIPLLLFSVIRSLLDSLGLTKL 
SMYLMLLLLPLNSGFNYLLIYGAFGVPELGGAGAGLGTSLAYWVLLGISVLVLFKQEKLKALHLEKRIPLNMDKIKEGV 
RLGLPIGGTVFAEVAIFSVVGLIMAKFSPLIIASHQSAMNFSSLMYAFPMSISSAMAIVVSYEVGAKRFDDAKTYIGLGRW 
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TALIFAAFTLTFLYIFRGNVASLYGNDPKFIDLTVRFLTYSLFFQLADTFAAPLQGILRGYKDTVIPFYLGLLGYWGVAIP 
VYAIZ 

MSTLAKIEALLFVAGEDGIRVRQLAELLSLPPTGIQQSLGKLAQKYEKDPDSSLALIETSGAYRLVTKPQFAEILKEYSKA 
5 PINQSLSRAALETLSIIAYKQPITRIEIDAIRGVNSSGALAKLQAFDLIKEDGKKEVLGRPNLYVTTDYFLDYMGINHLEEL 
PVIDELEIQAQESQLFGERIEEDENQZ 

MDTMISRFFRHLFEALKSLKRNGWMTVAAVSSVMITLTLVAIFASVIFNTAKLATDIENNVRVVVYIRKDVEDNSQTIE 
KEGQTVTNNDYHKVYDSLKNMSTVKSVTFSSKEEQYEKLTEIMGDNWKIFEGDANPLYDAYIVEANTPNDVKTLAEDA 
10 KKIEGVSEVQDGGANTERLFKLASFIRVWGLGIAALLIFIAVFLISNTIRITIISRSREIQIMRLVGAKNSYIRGPFLLEGAFIG 
LLGAIAPSVLVFIVYQIVYQSVNKSLVGQNLSMISPDLFSPLMIALLFVIGVFIGSLGSGISMRRFLKIZ 

MKKVRFIFLALLFFLASPEGAMASDGTWQGKQYLKEDGSQAANEWVFDTHYQSWFYIKADANYAENEWLKQGDDYF 
YLKSGGYMAKSEWVEDKGAFYYLDQDGKMKRNAWVGTSYVGATGAKVIEDWVYDSQYDAWFYIKADGQHAEKEW 

15 LQIKGKDYYFKSGGYLLTSQWINQAYVNASGAKVQQGWLFDKQYQSWFYIKENGNYADKEWIFENGHYYYLKSGGY 
MAANEWIWDKESWFYLKFDGKMAEKEWVYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGKIAEKEWVYD 
SHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGKIAEKEWVYDSHSQAWYYFKSGGYMAKNETVDGYQLGSDGK 
WLGGKTTNENAAYYQVVPVTANVYDSDGEKLSYISQGSVVWLDKDRKSDDKRLAITISGLSGYMKTEDLQALDASKD 
FIPYYESDGHRFYHYVAQNASIPVASHLSDMEVGKKYYSADGLHFDGFKLENPFLFKDLTEATNYSAEELDKVFSLLNI 

20 NNSLLENKGATFKEAEEHYHINALYLLAHSALESNWGRSKIAKDKNNFFGITAYDTTPYLSAKTFDDVDKGILGATKWI 
KENYIDRGRTFLGNKASGMNVEYASDPYWGEKIASVMMKINEKLGGKDZ 

MKKVLQKYWAWAFVVIPLLLQAIFFYVPMFQGAFYSFTNWTGLTYNYKFVGLNNFKLLFMDPKFMNAIGFTAIIAIAM 
VVGEIALGIFIARVLNSKIKGQTFFRAWFFFPAVLSGLTVALIFKQVFNYGLPAIGNALHIEFFQTSLLGTKWGAIFAAVF 
25 VLLWQGVAMPIIIFLAGLQSIPTEITEAARIDGATSKQVFWNIELPYLLPSVSMVFILALKGGLTAFDQVFAMTGGGPNN 
ATTSLGLLVYNYAFKNNQFGYANAIAVILFFLIVVISIIQLRVSKKFEIZ 

MMKQDERKALIGKYILLILGSVLILVPLLATLFSSFKPTKDIVDNFFGFPTNFTWDNFSRLLADGIGGYYWNSVVITVLSL 
LAVMIFIPMAAYSIARNMSKRKAFTIMYTLLILGIFVPFQVIM^ 
30 SIPESLDEAAEIDGANQFTTYFRIIFPMMKPMHATTMIINALWFWNDFMLPLLVLNRDSKMWTLPLFQYNYAGQYFND 
YGPSFASYVVGIISITIVYLFFQRHIISGMSNGAVKZ 

MKSILQKMGEHPMLLLFLSYSTVISILAQNWMGLVASVGMFLFTIFFLHYQSILSHKFFRLILQFVLFGSVLSAAFASLEH 
FQIVKKFNYAFLSPNMQVWHQNRAEVTFFNPNYYGIICCFCIMIAFYLFTTTKLNWLKVFCVIAGFVNLFGLNFTQNRT 
35 AFPAIIAGAIIYLFTTIKNWKAFWLSIGVFAIGLSFLFSSDLGVRMGTLDSSMEERISIWDAGMALFKQNPFWGEGPLTYM 
NSYPRIHAPYHEHAHSLYIDTILSYGIVGTILLVLSSVAPVRLMMDMSQESGKRPIIGLYLSFLTVVAVHGIFDLALFWIQS 
GFIFLLVMCSIPLEHRMLVSDMTDZ 

MSKMDVQKIIAPMMKFVNMRGIIALKDGMLAILPLTVVGSLFLIMGQLPFEGLNKSIASVFGANWTEPFMQVYSGTFAI 
40 MGLISCFSIAYSYAKNSGVEALPAGVLSVSAFFILLRSSYIPKQGEAIGDAISKVWFGGQGIIGAIIIGLVVGSIYTFFIKRKIV 
IKMPEQVPQAIAKQFEAMIPAFVIFLSSMIVYILAKSLTNGGTFIEMIYSAIQVPLQGLTGSLYGAIGIAFFISFLWWFGVH 
GQSVVNGVVTALLLSNLDANKAMLASANLSLENGAHIVTQQFLDSFLILSGSGITFGLVVAMLFAAKSKQYQALGKVA 
AFPAIFNVNEPVVFGFPIVMNPVMFVPFILVPVLAAVIVYGAIATGFMQPFSGVTLPWSTPAILSGFLVGGWQGVITQLVI 
LAMSTLVYFPFFKVQDRLAYQNEIKQSZ 

45 

MKKKDLVDQLVSEIETGKVRTLGIYGHGASGKSTFAQELYQALDSTTVNLLETDPYITSGRHLVVPICDAPNQKVTASLP 
VAHELESLQRDILACRRVWMSZ 

MKKRYLVLTALLALSLAACSQEKTKNEDGETKTEQTAKADGTVGSKSQGAAQKKAEVVNKGDYYSIQGKYDEIIVAN 
50 KHYPLSKDYNPGENPTAKAELVKLIKAMQEAGFPISDHYSGFRSYETQTKLYQDYVNQDGKAAADRYSARPGYSEHQT 
GLAFDVIGTDGDLVTEEKAAQWLLDHAADYGFVVRYLKGKEKETGYMAEEWHLRYVGKEAKEIAASGLSLEEYYGF 
EGGDYVDZ 

MREPDFLNHFLKKGYFKKHAKAVLALSGGLDSMFLFKVLSTYQKELEIELILAHVNHKQRIESDWEEKELRKLAAEAE 
55 LPIYISNFSGEFSEARARNFRYDFFQEVMKKTGATALVTAHHADDQVETIFMRLIRGTRLRYLSGIKEKQVVGEIEIIRPFL 
HFQKKDFPSIFHFEDTSNQENHYFRNRIRNSYLPELEKENPRFRDAILGIGNEILDYDLAIAELSNNINVEDLQQLFSYSES 
TQRVLLQTYLNRFPDLNLTKAQFAEVQQILKSKSQYRHPIKNGYELIKEYQQFQICKISPQADEKEDELVLHYQNQVAY 
QGYLFSFGLPLEGELIQQIPVSRETSIHIRHRKTGDVLIKNGHRKKLRRLFIDLKIPMEKRNSALIIEQFGEIVSILGIATNNL 
SKKTKNDIMNTVLYIEKIDRZ 

60 

MRKFLIILLLPSFLTISKVVSTEKEVVYTSKEIYYLSQSDFGIYFREKLSSPMVYGEVPVYANEDLVVESGKLTPICTSFQIT 
EWRLNKQGIPVFKLSNHQFIAADKRFLYDQSEVTPTIKKVWLESDFKLYNSPYDLKEVKSSLSAYSQVSIDKTMFVEGRE 
FLHIDQAGWVAKESTSEEDNRMSKVQEMLSEKYQKDSFSrYVKQLTTGKEAGINQDEKMYAASVLKLSYLYYTQEKIN 
EGLYQLDTTVKYVSAVNDFPGSYKPEGSGSLPKKEDNKEYSLKDLITKVSKESDNVAHNLLGYYISNQSDATFKSKMSA 
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IMGDDWDPKEKLISSKMAGKFMEAIYNQNGFVLESLTKTDFDSQRIAKGVSVKVAHKIGDADEFKHDTGVVYADSPFIL 
SIFTKNSDYDTISKIAKDVYEVLKZ 

MKKQNNGLIKNPFLWLLFIFFLVTGFQYFYSGNNSGGSQQINYTELVQEITDGNVKELTYQPNGSVIEVSGVYKNPKTSK 
5 EETGIQFFTPSVTKVEKFTSTILPADTTVSELQKLATDHKAEVTVKHESSSGIWINLLVSIVPFGILFFFLFSMMGNMGGG 
NGRNPMSFGRSKAKAANKEDIKVRFSDVAGAEEEKQELVEVVEFLKDPKRFTKLGAR1PAGVLLEGPPGTGKTLLAKA 
VAGEAGVPFFSISGSDFVEMFVGVGASRVRSLFEDAKKAAPAIIFIDEIDAVGRQRGVGLGGGNDEREQTLNQLLIEMDG 
FEGNEGIIVIAATNRSDVLDPALLRPGRFDRKVLVGRPDVKGREAILKVHAKNKPLAEDVDLKLVAQQTPGFVGADLEN 
VLNEAALVAARRNKSIIDASDIDEAEDRVIAGPSKKDKTVSQKERELVAYHEAGHTIVGLVLSNARVVHKVTIVPRGRA 
10 GGYMIALPKEDQMLLSKEDMKEQLAGLMGGRVAEEIIFNVQTTGASNDFEQATQMARAMVTEYGMSEKLGPVQYEG 
NHAMLGAQSPQKSISEQTAYEIDEEVRSLLNEARNKAAEIIQSNRETHKLIAEALLKYETLDSTQIKALYETGKMPEAVE 
EES HALS YDEVKSKMNDEKZ 

MKRSSLLVRMVISIFLVFLILLALVGTFYYQSSSSAIEATIEGNSQTTISQTSHFIQSYIKKLETTSTGLTQQTDVLAYAENP 
15 SQDKVEGIRDLFLTILKSDKDLKTVVLVTKSGQVISTDDSVQMKTSSDMMAEDWYQKAIHQGAMPVLTPARKSDSQW 
VISVTQELVDAKGANLGVLRLDISYETLEAYLNQLQLGQQGFAFIINENHEFVYHPQHTVYSSSSKMEAMKPYIDTGQG 
YTPGHKSYVSQEKIAGTDWTVLGVSSLEKLDQVRSQLLWTLLGASVTSLLVCLCLVWFSLKRWIAPLKDLRETMLEIAS 
GAQNLRAKEVGAYELREVTRQFNAMLDQIDQLMVAIRSQEETTRQYQLQALSSQINPHFLYNTLDTIIWMAEFHDSQR 
VVQVTKSLATYFRLALNQGKDLICLSDEINHVRQYLFIQKQRYGDKLEYEINENVAFDNLVLPKLVLQPLVENALYHGI 
20 KEKEGQGHIKLSVQKQDSGLVIRIEDDGVGFQDAGDSSQSQLKRGGVGLQNVDQRLKLHFGANYHMKIDSRPQKGTKV 
EIYINRIETSZ 

MKRSSLLVRMVISIFLVFLILLALVGTFYYQSSSSAIEATIEGNSQTTISQTSHFIQSYIKKLETTSTGLTQQTDVLAYAENP 
SQDKVEGIRDLFLTILKSDKDLKTVVLVTKSGQVISTDDSVQMKTSSDMMAEDWYQKAIHQGAMPVLTPARKSDSQW 

25 VISVTQELVDAKGANLGVLRLDISYETLEAYLNQLQLGQQGFAFIINENHEFVYHPQHTVYSSSSKMEAMKPYIDTGQG 
YTPGHKSYVSQEK1AGTDWTVLGVSSLEKLDQVRSQLLWTLLGASVTSLLVCLCLVWFSLKRWIAPLKDLRETMLEIAS 
GAQNLRAKEVGAYELREVTRQFNAMLDQIDQLMVAIRSQEETTRQYQLQALSSQINPHFLYNTLDTIIWMAEFHDSQR 
VVQVTKSLATYFRLALNQGKDLICLSDEINHVRQYLFIQKQRYGDKLEYEINENVAFDNLVLPKLVLQPLVENALYHGI 
KEKEGQGHIKLSVQKQDSGLVIRIEDDGVGFQDAGDSSQSQLKRGGVGLQNVDQRLKLHFGANYHMKIDSRPQKGTKV 

30 EIYINRIETSZ 

MFFKLLREALKVKQVRSKILFTIFIVLVFRIGTSITVPGVNANSLNALSGLSFLNMLSLVSGNALKNFSIFALGVSPYITASI 
VVQLLQMDILPKFVEWGKQGEVGRRKLNQATRYIALVLAFVQSIGITAGFNTLAGAQLIKTALTPQVFLTIGIILTAGSMI 
VTWLGEQITDKGYGNGVSMIIFAGIVSSIPEMIQGIYVDYFVNVPSSRITSSIIFVIILIITVLLIIYFTTYVQQAEYKIPIQYTK 
35 VAQGAPSSSYLPLKVNPAGVIPVIFASSITAAPAAILQFLSATGHDWAWVRVAQEMLATTSPTGIAMYALLIILFTFFYTF 
VQINPEKAAETYKRVVPISMEFVLVKVQKNICLNFFVVLQLLVPSSLVZ 

MDIRQVTETIAMIEEQNFDIRTITMGISLLDCIDPDINRAAEKIYQKITTKAANLVAVGDEIAAELGIPIVNKRVSVTPISLIG 
AATDATDYVVLAKALDKAAKEIGVDFIGGFSALVQKGYQKGDEILINSIPRALAETDKVCSSVNIGSTKSGINMTAVAD 
40 MGRIIKETANLSDMGVAKLVVFANAVEDNPFMAGAFHGVGEADVIINVGVSGPGVVKRALEKVRGQSFDVVAETVKK 
TAFKITRIGQLVGQMASERLGVEFGIVDLSLAPTPAVGDSVARVLEEMGLETVGTHGTTAALALLNDQVKKGGVMAC 
NQVGGLSGAFIPVSEDEGMIAAVQNGSLNLEKLEAMTAICSVGLDMIAIPEDTPAETIAAMIADEAAIGVINMKTTAVRII 
PKGKEGDMIEFGGLLGTAPVMKVNGASSVDFISRGGQIPAPIHSFKNZ 

45 MTQIIDGKj\LAAKLQGQLAEKTAKLKEETGLVPGLVVILVGDNPASQVYVRNKERSALAAGFRSEVVRVPETITQEELL 
DLIAKYNQDPAWHGILVQLPLPKHIDEEAVLLAIDPEKDVDGFHPLNMGRLWSGHPVMIPSTPAGIMEMFHEYGIDLEG 
KNAVVIGRSNIVGKPMAQLLLAKNATVTLTHSRTHNLSKVAAKADILVVAIGRAKFVTADFVKPGAVVIDVGMNRDEN 
GKLCGDVDYEAVAPLASHITPVPGGVGPMTITMLMEQTYQAALRTLDRKZ 

50 MSKFNRIHLVVLDSVGIGAAPDANNFVNAGVPDGASDTLGHISKTVGLNVPNMAKIGLGNIPRETPLKTVAAESNPTGY 
ATKLEEVSLGKDTMTGHWEIMGLNITEPFDTFWNGFPEEILTKIEEFSGRKVIREANKPYSGTAVIYDFGPRQMETGELII 
YTSADPVLQIAAHEDIIPLDELYRICEYARSITLERPALLGRIIARPYVGEPGNFTRTANRRDLAVSPFFPTVLDKLNEAGI 
DTYAVGKINDIFNGAGINHDMGHNKSNSHGIDTLLKTMGLAEFEKGFSFTNLVDFDALYGHRRNAHGYRDCLHEFDE 
RLPEIIAAMRENDLLLITADHGNDPTYAGTDHTREYIPLLAYSPAFKGNGLIPVGHFADISATVADNFGVETAMIGESFL 

55 DKLVZ 

MFISISAGIVTFLLTLVEIPAFIQFYRKAQITGQQMHEDVKQHQAKAGTPTMGGLVFLITSVLVAFFFALFSSQFSNNVGM 
ILFILVLYGLVGFLDDFLKVFRKINEGLNPKQKLALQLLGGVIFYLFYERGGDILSVFGYPVHLGFFYIFFALFWLVGFSN 
AVNLTDGVDGLASISVVISLSAYGVIAYVQGQMDILLVILAMIGGLLGFFIFNHKPAKVFMGDVGSLALGGMLAAISMA 
60 LHQEWTLLIIGIVYVFETTSVMMQVSYFKLTGGKRIFRMTPVHHHFELGGLSGKGNPWSEWKVDFFFWGVGLLASLLT 
LAILYLMZ 

LFKKNKDILNIALPAMGENFLQMLMGMVDSYLVAHLGLIAISGVSVAGNIITIYQAIFIALGAAISSVISKSIGQKDQSKLA 
YHVTEALKITLLLSFLLGFLSIFAGKEMIGLLGTERDVAESGGLYLSLVGGSIVLLGLMTSLGALIRATHNPRLPLYVSFL 
65 SNALNILFSSLAIFVLDMGIAGVAWGTIVSRLVGLVILWSQLKLPYGKPTFGLDKELLTLALPAAGERLMMRAGDVVIIA 
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LVVSFGTEAVAGNAIGEVLTQFNYMPAFGVATATVMLLARAVGEDDWKRVASLSKQTFWLSLFLMLPLSFSIYVLGVP 
LTHLYTTDSLAVEASVLVTLFSLLGTPMTTGTVIYTAVWQGLGNARLPFYATSIGMWCIR1GTGYLMGIVLGWGLPGIW 
AGSLLDNGFRWLFLRYRYQRYMSLKGZ 

5 MQTQEKHSQAAVLGLQHLLAMYSGSILVPIMIATALGYSAEQLTYLISTDIFMCGVATFLQLQLNKYFGIGLPVVLGVA 
FQSVAPLIMIGQSHGSGAMFGALIASGIYVVLVSGIFSKVANLFPSIVTGSVITTIGLTLIPVAIGNMGNNVPEPTGQSLLLA 
AITVLIILLINIFTKGFIKSISILIGLVVGTAIAATMGLVDFSPVAVAPLVHVPTPLYFGMPTFEISSIVMMCIIATVSMVEST 
GVYLALSDITKDPIDSTRLRNGYRAEGLAVLLGGIFNTFPYTGFSQNVGLVKLSGIKKRLPIYYAAGFLVLLGLLPKFGA 
LAQIIPSSVLGGAMLVMFGFVSIQGMQILARVDFANNEHNFLIAAVSIAAGVGLNNSNLFVSMPTAFQMFFSNGIVVASL 
10 LAIVLNAVLNHKKKZ 

MKDRIKEYLQDKGKVTVNDLAQALGKDSSKDFRELIKTLSLMERKHQIRFEEDGSLTLEIKKKHEITLKGIFHAHKNGFG 
FVSLEGEEDDLFVGKNDVNYAIDGDTVEVVIKKVADRNKGTAAEAKIIDILEHSLTTVVGQIVLDQEKPKYAGYIRSKN 

15 QKISQPIYVKKPALKLEGTEVLKVFIDKYPSKKHDFFVASVLDVVGHSTDVGIDVLEVLESMDIVSEFPEAVVKEAESVP 
DAPSQKDMEGRLDLRDEITFTIDGADAKDLDDAVHIKALKNGNLEFGVHIADVSYYVTEGSALDKEALNRATSVYVTD 
RVVPMLPERLSNGICSLNPQVDRLTQSAIMEIDKHGRVVNYTITQTVIKTSFRMTYSDVNDILAGDEEKRKEYHKIVSSIE 
LMAKLHETLENMRVKRGALNFDTNEAKILVDKQGKPVDIVLRQRGIAERMIESFMLMANETVAEHFSKLDLPFIYRIHE 
EPKAEKVQKFIDYASSFGLRIYGTASEISQEALQDIMRAVEGEPYADVLSMMLLRSMQQARYSEHNHGHYGLAADYYT 

20 HFTSPIRRYPDLLVHRMIRDYGRSKEIAEHFEQVIPEIATQSSNRERRAIEAEREVEAMKKAEYMEEYVGEEYDAVVSSIV 
KFGLFVELPNTVEGLIHITNLPEFYHFNERDLTLRGEKSGITFRVGQQIRIRVERADKMTGEIDFSFVPSEFDVIEKGLKQS 
SRSGRGRDSNRRSDKKEDKRKSGRSNDKRKHSQKDKKKKGKKPFYKEVAKKGAKHGKGRGKGRRTKZ 

MGTTGFTIIDLIILIVYLLAVLVAGIYFSKKEMKGKEFFKGDGSVPWYVTSVSIFATMLSPISFLGLAGSSYAGSWILWFA 
25 QLGMVVAIPLTIRFILPIFARIDIDTAYDYLDKRFNSKALRJISALLFIIYQLGRMSIIMYLPSAGLSVLTGIDINILIILMGVV 
AIVYSYTGGLKSVLWTDFIQGVIUSGVVLALFVLIANIKGGFGAVAETLANGKFLAANEKLFDPNLLSNSIFLIVMGSGF 
TILSSYASSQDLVQRFTTTQNIKKLNKMLFTNGVLSLATATVFYLIGTGLYVFYQVQNADSAASNIPQDQIFMYFIAYQL 
PVGITGLILAAIYAASQSTISTGLNSVATSWTLDIQDVISKNMSDNRRTKIAQFVSLAVGLFSIGVSIVMAHSDIKSAYEWF 
NSFMGLVLGLLGGVFILGFVSKKANKQGAYAALIVSTIVMVFIKYFLPPTAVSYWAYSLISISVSVVSGYIVSVLTGNKVS 
30 APKYTTIHDITEIKADSSWEVRHZ 

MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQKSENLTPDQVSQKEGIQAEQIVIKITDQGYV 
TSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLKDAAHADNVRTKDEINRQK 
QEHVKDNEKVNSNVAVARSQGRYTTNDGYVFNPADIIEDTGNAYIVPHGGHYHYIPKSDLSASELAAAKAHLAGKNM 
35 QPSQLSYSSTASDNNTQSVAKGSTSKPANKSENLQSLLKELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHF 
IPYSKLSALEEKIARMVPISGTGSTVSTNAKPNEVVSSLGSLSSNPSSLTTSKELSSASDGYIFNPKDIVEETATAYIVRHGD 
HFHYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHEEDGYGFDANRIIAEDESGFVMSHGDHNHYFFKKDLTEEQIK 
VRKNIZ 

40 MKKRAIVAVIVLLLIGLDQLVKSYIVQQIPLGEVRSWIPNFVSLTYLQNRGAAFSILQDQQLLFAVITLVVVIGAIWYLHK 
HMEDSFWMVLGLTLIIAGGLGNFIDRVSQGFVVDMFHLDFINFAIFNVADSYLTVGVIILLIAMLKEEINGNZ 

MNTNLASFIVGLIIDENDRFYFVQKDGQTYALAKEEGQHTVGDTVKGFAYTDMKQKLRLTTLEVTATQDQFGWGRVT 
EVRKDLGVFVDTGLPDKEIVVSLDILPELKELWPKKGDQLYIRLEVDKKDR1WGLLAYQEDFQRLARPAYNNMQNQN 
45 WPAIVYRLKLSGTFVYLPENNMLGFIHPSERYAEPRLGQVLDARVIGFREVDRTLNLSLKPRSFEMLENDAQMILTYLE 
SNGGFMTLNDKSSPDDIKATFGISKGQFKKALGGLMKAGKIKQDQFGTELIZ 

MKDVSLFLLKKVFKSRLNWIVLALFVSVLGVTFYLNSQTANSHSLESRLESRIAANERAINENEEKLSQMSDTSSEEYQF 
AKNNLDVQKNLLTRKTEILTLLKEGRWKEAYYLQWQDEEKNYEFVSNDPTASPGLKMGVDRERKIYQALYPLNIKAH 
50 TLEFPTHGIDQIVWILEVnPSLFVVAIIFMLTQLFAERYQNHLDTAHLYPVSKVTFAISSLGVGVGYVTVLFIGICGFSFLV 
GSLISGFGQLDYPYPIYSLVNQEVTIGKIQDVLFPGLLLAFLAFIVIVEVVYLIAYFFKQKMPVLFLSLIGIVGLLFGIQTIQP 
LQR1AHLIPFTYLRSVEILSGRLPKQIDNVDLNWSMGMVLLPCLIIFLLLGILFIERWGSSQKKEFFNRFZ 

MMKFILDIVSTPAILVALIAILGLVLQKKKLPDIIKGGIKTFVGFLVVSGGAGIVQNSLNPFGTMFEHAFHLSGVVPNNEAI 
55 VAVALTTYGSATAMIMFAGMVFNILIARFTRFKYIFLTGHHTLYMACMIAVILSVAGFTSLPLILLGGLALGIIMS1SPAF 
VQKYMVQLTGNDKVALGHFSSLGYWLSGFTGSLIGDKSKSTEDIKFPKSLAFLRDSTVSITLSMAVIYIIVAIFAGSEYIEK 
EISSGTSGLVYALQLAGQFAAGVFVILAGVRLILGEIVPAFKGISERLVPNSKPALDCPIVYTYAPNAVLIGFISSFVGGLVS 
MVIMIASGTVVILPGVVPHFFCGATAGVIGNASGGVRGATIGAFLQGILISFLPVFLMPVLGGLGFQGSTFSDADFGLSGII 
^ LGMLNQFGSQAGIVIGLVLILAVMFGVSFIKKPSATEEZ 

MIKTFLSALSVILFSIPIITYSFFPSSNLNIWLSTQPILAQIYAFPLATATMAAILSFLFFFLSFYKKNKQIRFYSGILLLLSLIL 
LLFGTDKTLSSASNKTKTLKLVTWNVANQIEAQHIERIFSHFDADMAIFPELATNIRGEQENQRIKLLFHQVGLSMANYD 
IFTSPPTNSGIAPVTVIVKKSYGFYTEAKTFHTTRFGTIVLHSRKQNIPDIIALHTAPPLPGLMEIWKQDLNIIHNQLASKYP 
KAIIAGDFNATMRHGALAKISSHRDALNALPPFERGTWNSQSPKLFNATIDHILLPKNHYYVKDLDIVSFQNSDHRCIFT 
65 EITFZ 
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MNPIQRSWAYVSRKRLRSFILFLILLVLLAGISACLTLMKSNKTVESNLYKSLNTSFSIKKIENGQTFKLSDLASVSKIKGL 
ENVSPELETVAKLKDKEAVTGEQSVERDDLSAADNNLVSLTALEDSSKDVTFTSSAFNLKEGRHLQKGDSKKILIHEEL 
AKKNGLSLHDKIGLDAGQSESGKGQTVEFEIIGIFSGKKQEKFTGLSSDFSENQVFTDYESSQTLLGNSEAQVSAARFYVE 
5 NPKEMDGLMKQVENLALENQGYQVEKENKAFEQIKDSVATFQTFLTIFLYGMLIAGAGALILVLSLWLRERVYEVGIL 
LALGKGKSSIFLQFCLEVVLVSLGALLPAFVAGNAITTYLLQTLLASGDQASLQDTLAKASSLSTSILSFAESYVFLVLLS 
CLSVALCFLFLFRKSPKEILSSISZ 

MLHNAFAYVTRKFFKSIVIFLIILLMASLSLVGLSIKGATAKASQETFKNITNSFSMQINRRVNQGTPRGAGNIKGEDIKKI 
10 TENKAIESYVKRINAIGDLTGYDLIETPETKKNLTADRAKRFGSSLMITGVNDSSKEDKFVSGSYKLVEGEHLTNDDKDK 
ILLHKDLAAKHGWKVGDKVKLDSNIYDADNEKGAKETVEVTIKGLFDGHNKSAVTYSQELYENTAITDIHTAAKLYGY 
TEDTAIYGDATFFVTADKNLDDVMKELNGISGINWKSYTLVKSSSNYPALEQSISGMYKMANLLFWGSLSFSVLLLALL 
LSLWINARRKEVGILLSIGLKQASILGQFITESILIAIPALVSAYFLANYTARAIGNTVLANVTSGVAKQASKAAQASNLGG 
GAEVDGFSKTLSSLDISIQTSDFIIIFVLALVLVVLVMALASSNLLRKQPKELLLDGEZ 

MSQDKQMKAVSPLLQRVINISSIVGGVGSLIFCIWAYQAGILQSKETLSAFIQQAGIWGPPLFIFLQILQTVVPIIPGALTSV 

AGVFIYGHIIGTIYNYIGIVIGCAIIFYLVRLYGAAFVQSVVSKRTYDKYIDWLDKGNRFDRFFIFMMIWPISPADFLCMLA 

ALTKMSFKRYMTIIILTKPFTLVVYTYGLTYIIDFFWQMLZ 

20 MRNMWVVIKETYLRHVESWSFFFMVISPFLFLGISVGIGHLQGSSMAKNNKVAVVTTVPSVAEGLKNVNGVNFDYKDE 
ASAKEAIKEEKLKGYLTIDQEDSVLKAVYHGETSLENGIKFEVTGTLNELQNQLNRSTASLSQEQEKRLAQTIQFTEKIDE 
AKE^KFIQTIAAGALGFFLYMILITYAGVTAQEVASEKGTKIMEVVFSSIRASHYFYARMMALFLVILTHIGIYVVGGL 
AAVLLFKDLPFLAQSGILDHLGDAISLNTLLFILISLFMYVVLAAFLGSMVSRPEDSGKALSPLMILIMGGFFGVTALGAA 
GDNLLLKIGSYIPFISTFFMPFRTINDYAGGAEAWISLAITVIFAVVATGFIGRMYASLVLQTDDLGIWKTFKRALSYKZ 

25 

MTETIKLMKAHTSVRRFKEQEIPQVDLNEILTAAQMASSWKNFQSYSVIVVRSQEKKDALYELVPQEAIRQSAVFLLFV 
GDLNRAEKGARLHTDTFQPQGVEGLLISSVDAALAGQNALLAAESLGYGGVIIGLVRYKSEEVAELFNLPDYTYSVFG 
MALGVPNQHHDMKPRLPLENVVFEEEYQEQSTEAIQAYDRVQADYAGARATTSWSQRLAEQFGQAEPSSTRKNLEQK 
KLLZMLKLIAIVGTNSKRSTNRQLLQYMQKHFTDKAEIELVEIKAIPVFNKPADKQVPAEILEIAAKIEEADGVIIGTPEYD 
30 HSIPAVLMSALAWLSYGIYPLLNKPIMITGASYGTLGSSRAQLQLRQILNAPEIKANVLPDEFLLSHSLQAFNPSGDLVDL 
DVIKKLDAIFDDFRIFVKITEKLRNAQELLRKDAEDFDWENLZ 

MOTYQLNNGVEIPVLGFGTFKAKDGEEAYRAVLEALKAGYRHIDTAAIYQNEESVGQAIKDSGVPREEMFVTTKLWNS 
QQTYEQTRQALEKSIEKLGLDYLDLYLIHWPNPKPLRENDAWKTRNAEVWRAMEDLYQEGKIRAIGVSNFLPHHLDAL 
35 LETATIVPAVNQVRLAPGVYQDQVVAYCREKGILLEAWGPFGQGELFDSKQVQEIAANHGKSVAQIALAWSLAEGFLP 
LPKSVTTSRIQANLDCFGIELSHEERETLKTIAVQSGAPRVDDVDFZ 

MRCKMLDPIAIQLGPLAIRWYALCIVTGHLAVYLTMKEAPRKKIIPDDILDFILVAFPLAILGARLYYVIFRFDYYSQNLG 
EIFAIWNGGLAIYGGLITGALVLYIFADRKLINTWDFLDIAAPSVMIAQSLGRWGNFFNQEAYGATVDNLDYLPGFIRDQ 
40 MYIEGSYRQPTFLYESLWNLLGFAULIFRRKWKSLRRGHITAFYLIWYGFGRMVIEGMRTDSLMFFGFRVSQWLSVVLI 
GLGIMIVIYQNRKKAPYYITEEENZ 

MGKLSSILLGTVSGAALALFLTSDKGKQVCSQAQDFLDDLREDPEYAKEQVCEKLTEVKEQATDFVLKTKEQVESGEIT 
VDSILAQTKSYAFQATEASKNQLNNLKEQWQEKAEALDDSEEIVIDITEEZ 

45 

MKTKLIFWGSMLFLLSLSILLTIYLAWIFYPMEIQWLNLTNRVYLKPETIQYNFHILMNYLTNPFSQVLQMPDFRSSAAG 
LHHFAVVKNLFHLVQLVALVTLPSFYVFVNRIVKKDFLSLYRKSLLALVVLPVMIGLGGVLIGFDQFFTLFHQILFVGD 
DTWLFDPAKDPVIMILPETFFLHAFLLFFALYENFFGYLYLKSRRKZ 

50 MTYHFTEEYDIIVIGAGHAGVEASLAASRMGCKVLLATINIEMLAFMPCNPSIGGSAKGIVVREVDALGGEMAKTIDKT 
YIQMKMLNTGKGPAVRALRAQADKELYSKEMRKTVENQENLTLRQTMIDEILVEDGKVVGVRTATHQEYAAKAVIVT 
TGTALRGEIIIGDLKYSSGPNHSLASINLADNLKELGLEIGRFKTGTPPRVKASSINYDVTEIQPGDEVPNHFSYTSRDEDY 
VKDQVPCWLTYTNGTSHEIIQNNLHRAPMFTGVVKGVGPRYCPSIEDKIVRFADKERHQLFLEPEGRNTEEVYVQGLST 
SLPEDVQRDLVHSIKGLENAEMMRTGYAIEYDMVLPHQLRATLETKKISGLFTAGQTNGTSGYEEAAGQGIIAGINAAL 

55 KIQGKPELILKRSDGYIGVMIDDLVTKGTIEPYRLLTSRAEYRLILRHDNADMRLTEMGREIGLVDDERWARFEIKKNQF 
DNEMKRLDSIKLKPVKETNAKVEEMGFKPLTDAVTAKEFLRRPEVSYQDVVAFIGPAAEDLDDKIIELIETEIKYEGYISK 
AMDQVAKMKRMEEKRIPANIDWDDIDSIATEARQKFKLINPETIGQASRISGVNPADISILMVYLEGKNRSISKTLQKSKZ 

MTKQVLLVDDEEHILKLLDYHLSKEGFSTQLVTNGRKALALAETEPFDFILLDIMLPQLDGMEVCKRLRAKGVKTPIM 
60 MVSAKSDEFDKVLALELGADDYLTKPFSPRELLARVKAVLRRTKGEQEGDDSDNIADDSWLFGTLKVYPERHEVYKA 
NKLLS LTPKEF ES D K N P FFE V FK VS K VT AQ Z 

MTTFKDGFLWGGAVAAHQLEGGWQEGGKGISVADVMTAGRHGVAREITLGVLEGKYYPNHEAIDFYHRYKEDIALF 
AEMGFKCFRTSIAWTRIFPKGDELEPNEEGLQFYDNLFDECLKNGIEPVITLSHFEMPYHLVTEYGGWKNRKLIDFFARF 
65 AEVVFKRYKDKVKYWMTFNEINNQANYQEDFAPFTNSGIVYEEGDNREAIMYQAAHYELVASARAVKIGHEINPDFQI 
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GCMIAMCPIYPVTCNPKDILMAMKAMQKRYYFADVHVLGKYPEHIFKYWERKGISVDFTAQDKEDLLGGTVDYIGFS 
YYMSFAIDSHRENNPYFDYLETEDLVKNNYVKASEWEWQIDPEGLRYALNWFTDHYHLPLFIVENGFGAIDQVAADG 
MVHDDYRIEYLGAHIREMKKAVVEDGVDLMGYTPWGCIDLVSAGTGEMRKRYGFIYVDKDDNGKGSYNRSPKKSFG 
WYKEVISSNGESVEZ 

5 

MDQQNGLFGFLENHVMGPMGKLAQFKVVRAITAAGMAAVPFTIVGSMFLVFSILPQAFSFWPIVADIFSASFDKFTSLY 
MVANYATMGSLSLYFVLSLAYELTKIYAEEEELNMNPLNGALLALMAFVMTVPQIIFDGGMMKTVTSLKEGAVIADG 
WAMGm^VARFGTTGIFTAIIMAIVTVLIYmCVKHNWVIKMPEAVPEGVSRGFTALVPGFVVAFVVIFINGLLVAMGT 
10 DIFKVIAIPFGFVSNLTNSWIGLMIIYLLTQLLWIVGIHGANIVFAFVSPIALANMAENAAGGHFAVAGEFSNMFVIAGGS 
GATLGLCLYIAFASKSEQLKAIGRASVVPALFNINEPLIFGLPIIYNPALAIPFILAPMVTATIYYVANSLNFIKPIIAQVPWP 
TPVGIGAFLGTADLRAVLVALVCAFAAFLVYLPFIRVYDQKLVKEEQGIZ 

MKKFYVSPIFPILVGLIAFGVLSTFUFVNNNLLTVLILFLFVGGYVFLFKKLRVHYTRSDVEQIQYVNHQAEESLTALLEQ 
15 MPVGVMKLNLSSGEVEWFNPYAELILTKEDGDFDLEAVQTIIKASVGNPSTYAKLGEKRYAVHMDASSGVLYFVDVSR 
EQAITDELVTSRPVIGIVSVDNYDDLEDETSESDISQINSFVANFISEFSEKHMMFSRRVSMDRFYLFTDYTVLEGLMNDK 
FSVIDAFREESKQRQLPLTLSMGFSYGDGNHDEIGKVALLNLNLAEVRGGDQVVVKENDETKNPVYFGGGSAASIKRT 
RTRTRAMMTAISDKIRSVDQVFVVGHKNLDMDALGSAVGMQLFASNVIENSYALYDEEQMSPDIERAVSFIEKEGVTK 
LLSVKDAMGMVTNRSLLILVDHSKTALTLSKEFYDLFTQTIVIDHHRRDQDFPDNAVITYIESGASSASELVTELIQFQNS 
20 KKNRLSRMQASVLMAGMMLDTKNFTSRVTSRTFDVASYLRTRGSDSIAIQEIAATDFEEYREVNELILQGRKLGSDVLI 
AEAKDMKCYDTVVISKAADAMLAMSGIEASFVLAKNTQGFISISARSRSKLNVQRIMEELGGGGHFNLAAAQIKDVTLS 
EAGEKLTEIVLNEMKEKEKEEZ 

MKEKNMWKELLNRAGWILVFLLAVLLYQVPLVVTSILTLKEVALLQSGLIVAGLSIVVLALFIMGARKTKLASFNFSFF 
25 RAKDLARLGLSYLVIVGSNILGSILLQLSNETTTANQSQINDMVQNSSLISSFFLLALLAPICEEILCRGIVPFCKIFRGKENL 
GFVVGTIVFALLHQPSNLPSLLIYGGMSTVLSWTAYKTQRLEMSILLHMIVNGIAFCLLALVVIMSRTLGISVZ 

MKEKNMWKELLNRAGWILVFLLAVLLYQVPLVVTSILTLKEVALLQSGLIVAGLSIVVLALFIMGARKTKLASFNFSFF 
RAKDLARLGLSYLVIVGSNILGSILLQLSNETTTANQSQINDMVQNSSLISSFFLLALLAPICEEILCRGIVPKKIFRGKENL 
30 GFVVGTrVFALLHQPSNLPSLLIYGGMSTVLSWTAYKTQRLEMSILLHMIVNGIAFCLLALVVIMSRTLGISVZ 

MDTQKIEAAVKMIIEAVGEDANREGLQETPARVARMYQEIFSGLGQTAEEHLSKSFEIIDDNMVVEKDIFFHTMCEHHF 
LPFYGRAHIAYIPDGRVAGLSKLARTVEVYSKKPQIQERLNIEVADALMDYLGAKGAFVVIEAEHMCMSMRGVRKPGT 
ATLTTVARGLFETDKDLRDQAYRLMGLZMKDLFLKRKQAFRKECLGYLRYVLNDHFVLFLLVLLGFLAYQYSQLLQH 
35 FPENHWPILLFVGITSVLLLLWGGTATYMEAPDKLFLLVGEEEIKLHLKRQTGISLVFWLFVQTLFLLLFAPLFLAMGY 
GLPVFLLYVLLLGVGKYFHFCQKASKFFTETGLDWDYVISQESKRKQVLLRFFALFTQVKGISNSVKRRAYLDFILKAV 
QKVPGKIWQNLYLRSYLRNGDLFALSLRLLLLSLLAQVFIEQAWIATAVVVLFNYLLLFQLLALYHAFDYQYLTQLFPL 
DKGQKEKGLQEVVRGLTSFVLLVELVVGLITFQEKLALLALLGAGLVLLVLYLPYQVKRQMQDZ 

40 MRKSIVLAADNAYLIPLETTIKSVLYHNRDVDFYILNSDIAPEWFKLLGRKMEVVNSTIRSVHIDKELFESYKTGPHINYA 
SYFRFFATEVVESDRVLYLDSDIIVTGELATLFEIDLKGYSIGAVDDVYAYEGRKSGFNTGMLLMDVAKWKEHSIVNSL 
LELAAEQNQVVHLGDQSILNIYFEDNWLALDKTYNYMVGIDIYHLAQECERLDDNPPTIVHYASHDKPWNTYSISRLRE 
LWWVYRDLDWSEIAFQRSDLNYFERSNQSKKQVMLVTWSADIKHLEYLVQRLPDWHFHLAAPCDCSEELTSLSQYTN 
VTVYQNVLHSRIDWLLDDSIVYLDINTGGEVFNVVTRAQESGKKIFAFDITRKSMDDGLYDGIFSVERPDDLVDRMKNI 

45 EIEZ 

MTKIYSSIAVKKGLFTSFLLFIYVLGSRIILPFVDLNTKDFLGGSTAYLAFSAALTGGNLRSLSIFSVGLSPWMSAMILWQ 
MFSFSKRLGLTSTSIEIQDRRKMYLTLLIAVIQSLAVSLRLPVQSSYSAILVVLMNTILLIAGTFFLVWLSDLNASMGIGGSI 
VILLSSMVLNIPQDVLETFQTVHIPTGIIVLLALLTLVFSYLLALMYRARYLVPVNKIGLHNRFKRYSYLEIMLNPAGGMP 
50 YMYVMSFLSVPAYLFILLGFIFPNHSGLAALSKEFMVGKPLWVYVYISVLFLFSIIFAFVTMNGEEIADRMKKSGEYIYGI 
YPGADTSRFINRLVLRFSVIGGLFNVIMAGGPMLFVLFDEKLLRLAMIPGLFMMFGGMIFTIRDEVKALRLNETYRPLIZ 

MSSLSDQELVAKTVEFRQRLSEGESLDDILVEAFAVVREADKR1LGMFPYDVQVMGAIVMHYGNVAEMNTGEGKTLT 
ATMPVYLNAFSGEGVMVVTPNEYLSKRDAEEMGQVYRFLGLTIGVPFTEDPKKEMKAEEKKLIYASDIIYTTNSNLGFD 

55 YLNDNLASNEEGKFLRPFNYVIIDEIDDILLDSAQTPLIIAGSPRVQSNYYAIIDTLVTTLVEGEDYIFKEEKEEVWLTTKG 
AKSAENFLGIDNLYKEEHASFARHLVYAIRAHKLFTKDKDYIIRGNEMVLVDKGTGRLMEMTKLQGGLHQAIEAKEHV 
KLSPETRAMASITYQSLFKMFNKISGMTGTGKVAEKEFIETYNMSVVRIPTNRPRQRIDYPDNLYITLPEKVYASLEYIKQ 
YHAKGNPLLVFVGSVEMSQLYSSLLFREGIAHNVLNANNAAREAQIISESGQMGAVTVATSMAGRGTDIKLGKGVAEL 
GGLIVIGTERMESQRIDLQIRGRSGRQGDPGMSKFFVSLEDDVIKKFGPSWVHKKYKDYQVQDMTQPEVLKGRKYRKL 

60 VEKAQHASDSAGRSARRQTLEYAESMNIQRDIVYKERNRLIDGSRDLEDVVVDIIERYTEEVAADHYASRELLFHFIVTN 
ISFHVKEVPDYIDVTDKTAVRSFMKQVIDKELSEKKELLNQHDLYEQFLRLSLLKAIDDNWVEQVDYLQQLSMAIGGQS 
ASQKNPIVEYYQEAYAGFEAMKEQIHADMVRNLLMGLVEVTPKGEIVTHFPZ 

MIGTFAAALVAVLANFIVPIEITPNSANTEIAPPDGIGQVLSNLLLKLVDNPVNALLTANYIRILSWAVIFGIAMREASKNS 
65 QELLKTIADVTSKIVEWHNLAPFGILGLVFKTISDKGVGSLANYGILLVLLVTTMLFVAPVVNPLIAFFFMRRNPYPLVW 
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NCLRVSGVTAFFTRSSATNIPVNMKLCHDLGLNPDTYSVSIPLGSTINMAGVAITINLLTLAAVNTLGIPVDFATAFVLSV 
VAAISSCDASGIAGGSLLLIPVACSLFGISNDIAIQIVGVGFVIGVIQDSCETALNSSTDVLFTAVAEYAATRKKZ 



5 MSISQRTTKLILATCLACLLAYFLNLSSAVSAGIIALI^I^DTRRSTLKLARNRLFSMLLALAIGVLAFHLSGFHIWSLGLY 
LAFYVPLAYKMGWEIGITPSTVLVSHLLVQESTSPDLLVNEFLLFAIGTGFALLVNLYMPSREEEIQHYHTLVEEKLKDI 
LQRFKYYLSRGDGRNRAQLVAELDTLLKEALRLVYLDHSDHLFHQTDYHIHYFEMRQRQSRILRNMAQQINTCHLAAS 
ESLILAQLFSKIAGQLSQTNPASDLLDEIERYLEVFRNRSLPKTREEFETRATLLQLLREAKTFIQVKVDFYQKYRQZ 

10 MEIMSLAIAVFAVIIGLVIGYVSISAKMKSSQEAAELMLLNAEQEATNLRGQAEREADLLVNEAKRESKSLKKEALLEAK 
EEARKYREEVDAEFKSERQELKQIESRLTERATSLDRKDDNLTSKEQTLEQKEQSISDRAKNLDAREEQLEEVERQKEAE 
LERIGALSQAEARDIILAQTEENLTREIASRIREAEQEVKERSDKMAKDILVQAMQRIAGEYVAESTNSTVHLPDDTMKG 
RIIGREGRNIRTFESLTGVDVIIDDTPEVVTLSGFDPIRREIARMTMEMLLKDGRIHPARIEELVEKNRQEIDNKIREYGEA 
AAYEIGAPNLHPDLMKIMGRLQFRTSYGQNVLRHSIEVAKLAGIMASELGENAALARRAGFLHDIGKAIDHEVEGSHVE 

15 IGMELARKYKEPPVVVNTIASHHGDVEAESVIAVIVAAADALSAARPGARSESLESYIKRLHDLEEIANGFEGVQTSFAL 
QAGREIRIMVNPGKIKDDKVTILAHKVRKKIENNLDYPGNIKVTVIRELRAVDYAKZ 

MMLKPSIDTLLDKVPSKYSLVILEAKRAHELEAGAPATQGFKSEKSTLRALEEIESGNVTIHPDPEGKREAVRRRIEEEKR 
^ RKEEEEKKKEQIAKEKEDGEKIZ 

MSAYQLPTVWQDEASNQGAFTGLNRPTAGARFEQNLPKGEQAFQLYSLGTPNGVKVTILLEELLEAGFKEAAYDLYKI 
AIMDGDQFGSDFVKLNPNSKIPALLDQSGTENVRVFESAHILLYLAEKFGAFLPSNPVEKVEVLNWLFWQAGAAPFLG 
GGFGHFFNYAPEKLEYPINRFTMEVKRQLDLLDKELAQKPYIAGNDYTIADIAIWSWYGQLVQGNLYQGSAKFLDASS 
^ YQNLVKWAEKIANRPAVKRGLEVTYTEIKZ 

LASLITSIIMFYVGFDVLRDTIQKILSREETVIDPLGATLGIISAAIMFVVYL^ 

GTAIAILASSFNYPIVDKLVAIIITFFILKTAYDIFIESSFSLSDGFDDRLLEDYQKAIMEIPKISKVKSQRGRTYGSNIYLDIT 
LEMNPDLSVFESHEIADQVESMLEERFGVFDTDVHIEPAPIPEDEILDNVYKKLLMREQLIDQGNQLEELLTDDFVYIRQ 
DGEQMDKEAYKTKKELNSAIKDIQITSISQKTKLICYELDGIIHTSIWRRHETWQNIFHQETKKEZ 

30 

MTIKLVATDMDGTFLDGNGRFDMDRLKSLLVSYKEKGIYFAVASGRGFLSLEKLFAGVRDDIIFIAENGSLVEYQGQDL 
YEATMSRDFYLATFEKLKTSPYVDINKLLLTGKKGSYVLDTVDETYLKVSQHYNENIQKVASLEDITDDIFKFTTNFTEE 
TLEDGEAWVNENVPGVKAMTTGFESIDIVLDYVDKGVAIVELVKKLGITMDQVMAFGDNLNDLHMMQVVGHPVAPE 
NARPEILELAKTVIGHHKERSVIAYMEGLZ 

35 

MADIKLIALDLDGTLLTTDKRLTDRTKETLQAARDRGIKVVLTTGRPLKAMDFFLHELGTDGQEDEYTITFNGGLVQK 
NTGEILDKTVFSYDDVARLYEETEKLSLPLDAISEGTVYQIQSDQESLYAKFNPALTFVPVDFEDLSSQMTYNKCVTAFA 
QEPLDAAIQKISPELFDQYEIFKSREMLLEWSPKNVHKATGLAKLISHLGIDQSQVMACGDEANDLSMIEWAGLGVAM 
QNAVPEVKAAANVVTPMTNDEEAVAWAIEEYVLKENZ 

MESLLILLLIANLAGLFLIWQRQDRQEKHLSKSLEDQADHLSDQLDYRFDQARQASQLDQKDLEVVVSDRLQEVRIELH 
QGLTQVRQEMTDNLLQTRDKTDQRLQALQESNEQRLEQMRQTVEEKLEKTLQTRLQASFETVSKQLESVNRGLGEMQ 
TVARDVGALNKVLSGTKTRGILGELQLGQIIEDIMTPAQYEREYATVENSSERVEYAIKLPGQGDQEYVYLPIDSKFPLA 
DYYRLEEAYETGDKDEIERCRKSLLASVKRFARDIRNKYIAPPRTTNFGVLFVPTEGLYSEIVRNPVFFDDLRREEQIIVA 
45 GPSTLSALLNSLSVGFKTLNIQKSADHISKTLASVKTEFGKFGGILVKAQKHLQHASGNIDELLNRRTIAIERTLRHIELSE 
GEPALDLLHFQENEEEYEDZ 

MKISHMKKDELFEGFYLIKSADLRQTRAGKNYLAFTFQDDSGEIDGKLWDAQPHNIEAFTAGKVVHMKGRREVYNNT 
PQVNQITLRLPQAGEPNDPADFKVKSPVDVKEIRDYMSQMIFKIENPVWQRIVRNLYTKYDKEFYSYPAAKTNHHAFET 
50 GLAYHTATMVRLADAISEVYPQLNKSLLYAGIMLHDLAKVIELTGPDQTEYTVRGNLLGHIALIDSEITKTVMELGIDDT 
KEEVVLLRHVILSHHGLLEYGSPVRPRiMEAEIIHMIDNLDASMMMMSTALALVDKGEMTNKIFAMDNRSFYKPDLDZ 

MSEKLAKKGFKMPSSYTVLLIIIAIMAVLTWFIPAGAFIEGIYETQPQNPQGIWDVLMAPIRAMLGTHPEEGSLIKETSAAID 
VAFFILMVGGFLGIVNKTGALDVGIASIVKKYKGREKMLILVLMPLFALGGTTYGMGEETMAFYPLLVPVMMAVGFDS 
55 LTGVAIILLGSQIGCLASTLNPFATGIASATAGVGTGDGIVLRLIFWVTLTALSTWFVYRYADKIQKDPTKSLVYSTRKED 
LKHFNVEESSSVESTLSSKQKSVLFLFVLTFILMVLSFIPWTDLGVTIFDDFNTWLTGLPVIGNIVGSSTSALGTWYFPEG 
AMLFAFMGILIGVIYGLKEDKIISSFMNGAADLLSVALIVAIARGIQVIMNDGMITDTILNWGKEGLSGLSSQVFIVVTYIF 
YLPMSFLIPSSSGLASATMGIMAPLGEFVNVRPSLIITAYQSASGVLNLIAPTSGIVMGALALGRINIGTWWKFMGKLVVA 
IIVVTIALLLLGTFLPFLZ 

60 

MSNSFVKLLVSQLFANLADIFFRVTIIANIYIISKSVIATSLVPILIGISSFVASLLVPLVTKRLALNRVLSLSQFGKTILLAIL 
VGMFTVMQSVAPLVTYLFVVAISILDGFAAPVSYAIVPRYATDLGKANSALSMTGEAVQLIGWGLGGLLFATIGLLPTT 
CINLVLYIISSFLMLFLPNAEVEVLESETNLEILLKGWKLVARNPRLRLFVSANLLEIFSNTIWVSSIILVFVTELLNKTESY 
WGYSNTAYSIGIIISGLIAFRLSEKFLAAKWEPQLFTPNLKTIQNPCLSLDPGWFLFSPNGCFLLDKKEFPLYGISVEKNTK 
65 RKETHMNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLKLKER1SKYLVTNDQRRYCRYSDSDILVQFLFQLL 
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TGYGTDYACKELSADAYFPKLLEGGQLASQPTLSRFLSRTDEETVHSLRCLNLELVEFFLQFHQLNQLIVDIDSTHFTTY 
GKQEGVAYNAHYRAHGYHPLYAFEGKTGYCFNAQLRPGNRYCSEEADSFITPVLERFNQLLFRMDSGFATPKLYDLIE 
KTGQYYLIKLKKNTVLSRLGDLSLPCPQDEDLTILPHSAYSETLYQAGSWSHKRRVCQFSERKEGNLFYDVISLVTNMTS 
GTSQDQFQLYRGRGQAENFIKEMKEGFFGDKTDSSTLIKNEVRMMMSCIAYNLYLFLKHLAGGDFQTLTDCRFRHLFL 
5 HVVGKCVRTGRKQLLKLSSLYAYSELFSALYSRIRKVNLNLPVPYEPPRRKASLMMHZ 

MMEFFQQLPHLEPYGNPQYFVYVIAATLPIFIGLFFKKRFAWYEVLVSLFFIVTMLVGGKTNQLAALGIYLCWEILLLLF 
YKHYRKSKDGKWVFYLVSFLSLLPIIFVKVQPAINGTQSLLGFLGISYLTFRSVGIVIELRDGVIKDFTLWEFLRFLLFMPT 
FSSGPIDRFKRFNENYQAIPERDELMDMLDESVRYIMWGFLYKFILAHVLGETLLPPLKNLALQSGGFFNLYALAVMYT 
10 FGLELFFDFAGYSMFALAISNLMGIRSPINFNKPFLSRDLKEFWNRWHMSLSFWFRDFVFMRMVMVLTRKKVFKNRN 
VTSSMAYIVNMLIMGFWHGVTWYYIAYGLFHGLGLVINDAWVRKKKTLNKERXKAGKAALPENRWIQLLGMVVTFH 
VVMLSFLIFSGFLNNLWFKKZ 

MLKRLWMIFGPVLIAGLLVFLLIFFYPTEMHHNLGAEKRSAVATTIDSFKERSQKVRALSDPNVRFVPFFGSSEWLRFD 
15 GAHPAVLAEKYNRSYRPYLLGQGGAASLNQYFGMQQMLPQLENKQVVYVISPQWFSKNGYDPAAFQQYFNGDQLTS 
FLKHQSGDQASQYAATRLLQQFPNVAMKDLVQKLASKEELSTADNEMIELLARFNERQASFFGQFSVRGYVNYDKHV 
AKYLKILPDQFSYQAIEDVVKADAEKNTSNNEMGMENYFYNEQIKKDLKKLKDSQKSFTYLKSPEYNDLQLVLTQFSK 
SKVNPIFIIPPVNKKWMNYAGLREDMYQQTVQKIRYQLESQGFTNIADFSKDGGEPFFMKDTIHLGWLGWLAFDKAVD 
PFLSNPTPAPTYHLNERFFSKDWATYDGDVKEFQZ 

20 

MEKNLKALKQTTDQEGPAIEPEKAEDTKTVQNGYFEDAAVKDRTLSDYAGNWQSVYPFLEDGTFDQVFDYKAKLTG 
KMTQAEYKAYYTKGYHTDVTKINITDNTMEFVQGGQSKKYTYKYVGKKILTYKKGNRGVRFLFEATDADAGQFKYV 
QFSDHNVAPVKAEHFHIFFGGTSQEALFEEMDNWPTYYPDNLSGQEIAQEMLAHZ 

25 MKDGHLLAHHIRLLNGRIFQKLLSQDPEALYRGEQGKILAVLWNSETGCATATDIALATGLANNTLTTMIKKLEEQKL 
VIVSPCGKDKRKKYLVLTELGKSQKEVGHRVSQKLDTIFYKGFSEEEIHQFEGFQER1LANLKEKGNEVZ 

MTNLIATFQDRFSDWLTALSQHLQLSLLTLLLAILLAIPLAVFLRYHEKLADWVLQIAGIFQTIPSLALLGLFIPLMGIGTL 
PALTALVIYAIFPILQNTITGLKGIDPNLQEAGIAFGMTRWERLKKFEIPLAMPVIMSGIRTAAVLIIGTATLAALIGAGGL 
30 GSFILLGIDRNNASLILIGALSSAVLAIAFNFLLKVMEKAKLRTIFSGFALVALLLGLSYSPALLVQKEKENLVIAGKIGPEP 
EILANMYKLLIEENTSMTATVKPNFGKTSFLYEALKKGDIDIYPEFTGTVTESLLQPSPKVSHEPEQVYQVARDGIAKQD 
HLAYLKPMSYQNTYAVAVPKKIAQEYGLKTISDLKKVEGQLKAGFTLEFNDREDGNKGLQSMYGLNLNVATIEPALRY 
QAIQSGDIQITDAYSTDAELERYDLQVLEDDKQLFPPYQGAPLMKEALLKKHPELERVLNTLAGKITESQMSQLNYQVG 
VEGKSAKQVAKEFLQEQGLLKKZ 

35 

MMHTYLQKKIENIKTTLGEMSGGYRRMVAAMADLGFSGTMKAIWDDLFAHRSFAQWIYLLVLGSFPLWLELVYEHRI 
VDWIGMICSLTGIICVIFVSEGRASNYLFGLINSVIYLILALQKGFYGEVLTTLYFTVMQPIGLLVWIYQAQFBCKEKQEFV 
ARKLDGKGWTKYLSISVLWWLAFGFIYQSIGANRPYRDSITDATNGVGQILMTAVYREQWIFWAATNVFSIYLWWGES 
LQIQGKYLIYLINSLVGWYQWSKAAKQNTDLLNZ 

40 

MRNMKAKYAVWVAFFLNLTYAIVEFIAGGVFGSSAVLADSVHDLGDAIAIGISAFLETISNREEDNQYTLGYKRFSLLG 
ALVTAVILVTGSVLVILENVTKILHPQPVNDEGILWLGIIAITINLLASLVVGKGKTKNESILSLHFLEDTLGWVAVILMAI 
VLRFTDWYILDPLLSLVISFFILSKALPRFWSTLKIFLDAVPEGLDIKQVKSGLERLDNVASLNQLNLWTMDALEKNAIV 
HVCLKEMEHMETCKESIRIFLKDCGFQNITIEIDADLETHQTHKRKVCDLERSYEHQHZ 

MIEYKNVALRYTEKDVLRDVNLQIEDGEFMVLVGPSGSGKTTMLKMINRLLEPTDGNIYMDGKRIKDYDERELRLSTG 
YVLQA1ALFPNLTVAENIALIPEMKGWSKEEITKKTEELLAKVGLPVAEYGHRLPSELSGGEQQRVGIVRAMIGQPKIFL 
MDEPFSALDAISRKQLQVLTKELHKEFGMTTIFVTHDTDEALKLADRIAVLQDGEIRQVANPETILKAPATDFVADLFG 
GSVHDZ 

50 

MSAVAISAMTKVMQETHGNPSSIHGHGRQAGKLLREARQELAQLLRTKPQHIFFTSGGTEGNNTTIIGYCLRHQEQGKH 
IITTAIEHHAVLETIDYLVQHFGFEATIIQPENQEITAQQIQKALRDDTILVSTMFVNNETGNLLPIAEIGQILKQHPAAYH 
VDAVQAIGKIPIHSEELGIDFLTASAHKFHGPKGIGFLYASSMDFDSYLHGGDQEQKKRAGTENLPAIVGMVAALKEDL 
EKQEEHFQHVQNLETAFLAELEGIQYYLNRGKHHLPYVLNIGFPGQKNDLLLLRLDLAGISISTGSACTAGVVQSSHVLE 
55 AM YG ANS ERLKES LRIS LS PQ NT V EDLQTLAKTLKEIIGGZ 



MLFKLSKEKIELGLSRLSPARRIFLSFALVILLGSLLLSLPFVQVESSRATYFDHLFTAVSAVCVTGLSTLPVAHTYNIWG 
QIICLLLIQIGGLGLMTFIGVFYIQSKQKLSLRSRATIQDSFSYGETRSLRKFVYSIFLTTFLVESLGAILLSFRLIPQLGWGR 
60 GLFSSIFLAISAFCNAGFDNLGSTSLFAFQTDLLVNLVIAGLIITGGLGFMVWFDLAGHVGRKKKGRLHFHTKLVLLLTI 
GLLLFGTATTLFLEWNNAGTIGNLPVADKVLVSFFQTVTMRTAGFSTIDYTQAHPVTLLIYILQMFLGGAPGGTAGGLK 
ITTFFVLLVFARSELLGLPHANVARRTIAPRTVQKSFSVFIIFLMSFLIGLILLGITAKGNPPFIHLVFETISALSTVGVTANL 
TPDLGKLALSVIMPLMFMGRIGPLTLFVSLADYHPEKKDMIHYMKADISIGZ 

65 
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MSDRTIGILGLGIFGSSVLAALAKQDMNIIAIDDHAERINQFEPVLARGVIGDITDEELLRSAGIDTCDTVVVATGENLESS 
VLAVMHCKSLGVPTVIAKVKSQTAKKVLEKIGADSVISPEYEMGQSLAQTILFHNSVDVFQLDKNVSIVEMKIPQSWAG 
QSLSKLDLRGKYNLNILGFREQENSPLDVEFGPDDLLKADTYILAVINNQYLDTLVALNSZ 

5 

MKLLSIAISSYNAAAYLHYCVESLVIGGEQVGILIINDGSQDQTQEIAECLASKYPNIVRAIYQENKCHGGAVNRGLVEAS 

GRYFKVVDSDDWVDPRAYLKILETLQELESKGQEVDVFVTNFVYEKEGQSRKKSMSYDSVLPVRQIFGWDQVGNFSK 

GQYTMMHSLIYRTDLLRASQFZ 

10 

MKFNPNQRYTRWSIRRLSVGVASVVVASGFFVLVGQPSSVRADGLNPTPGQVLPEETSGTKEGDLSEKPGDTVLTQAKP 
EGVTGNTNSLPTPTERTEVSEETSPSSLDTLFEKDEEAQKNPELTDVLKETVDTADVDGTQASPAETTPEQVKGGVKEN 
TKDSIDVPAAYLEKAEGKGPFTAGVNQVIPYELFAGDGMLTRLLLKASDNAPWSDNGTAKNPALPPLEGLTKGKYFYE 
VDLNGNTVGKQGQALIDQLRANGTQTYKATVKVYGNKDGKADLTNLVATKNVDININGLVAKETVQKAVADNVKDS 
15 IDVPAAYLEKAKGEGPFTAGVNHVIPYELFAGDGMLTRLLLKASDKAPWSDNGDAKNPALSPLGENVKTKGQYFYQV 
ALDGNVAGKEKQALIDQFRANGTQTYSATVNVYGNKDGKPDLDNIVATKKVTININGLISKETVQKAVADNVKDSIDV 
PAAYLEKAKGEGPFTAGVNHVIPYELFAGDGMLTRLLLKASDKAPWSDNGDAKNPALSPLGENVKTKGQYFYQLALD 
GNVAGKEKQALIDQFRANGTQTYSATVNVYGNKDGKPDLDNIVATKKVTININGLISKETVQKAVADNVKTVSMFQQP 
TZ 

20 

MKLKSYILVGYIISTLLTILVVFWAVQKMLIAKGEIYFLLGMTIVASLVGAGISLFLLLPVFTSLGKLKEHAKRVAAKDFP 
SNLEVQGPVEFQQLGQTFNEMSHDLQVSFDSLEESEREKGLMIAQLSHDIKTPITSIQATVEGILDGIIKESEQAHYLATIG 
RQTERLNKLVEELNFLTLNTARNQVETTSKDSIFLDKLLIECMSEFQFLIEQERRDVHLQVIPESARIEGDYAKLSR1LVN 
25 LVDNAFKYSAPGTKLEVVAKLEKDQLSISVTDEGQGIAPEDLENIFKRLYRVETSRNMKTGGHGLGLAIARELAHQLGG 
EITVSSQYGLGSTFTLVLNLSGSENKAZ 

MFGQTAQHGLTNSLKDFWIFLLNIGPQLAFFCQMLRCSRSVEQGTGNHRREFNMIQQIFSHFGMTHLGQIKLVYQESID 
30 LELLVNALNHHLLIDRLVLTPNQITIEIDRQIVHGLDLLKGRKDKEIIDIKSMFRQLELASTQQICPINQRVHHGILAFGEIS 
DLVPAKNLPNRQDZ 

MEHLATYFSTYGGAFFAALGIVLAVGLSGMGSAYGVGKAGQSAAALLKEQPEKFASAULQLLPGTQGLYGFVIGILIW 
35 LQLTPELPLEKGVAYFFVALPIAIVGYFSAKHQGNVAVAGMQILAKRPKEFMKGAILAAMVETYAILAFVVSFILTLRVZ 

MLKSEKQSRYQMLNEELSFLLEGETNVLANLSNASALIKSRFPNTVFAGFYLFDGKELVLGPFQGGVSCIR1ALGKGVC 
GEAAHFQETVIVGDVTTYLNYISCDSLAKSEIVVPMMKNGQLLGVLDLDSSEIEDYDAMDRDYLEQFVAILLEKTAWD 
40 FTMFEEKSZ 

MSVLEIKDLHVEIEGKEILKGVNLTLKTGEIAAIMGPNGTGKSTLSAAIMGNPNYEVTKGEVLFDGVNILELEVDERAR 
MGLFLAMQYPSEIPGITNAEFLRAAMNAGKEDDEKISVREFITKLDEKMELLNMKEEMAERYLNEGFSGGEKKRNEIL 
45 QLLMLEPTFALLDEIDSGLDIDALKVVSKGVNAMRGEGFGAMIITHYQRLLNYITPDVVHVMMEGRVVLSGGPELAAR 
LEREG Y AKL AEELG Y D YKEELZ 

MPYKRQRSFSMALSKLDSLYMAVVADHSKNPHHQGKLEDAEQISLNNPTCGDVINLSVKFDAEDRLEDIAFLNSGCTIS 
50 TASASMMTDAVLGKTKQEILELATIFSEMVQGQKDERQDQLGDAAFLSGVAKFPQRIKCATLAWNALKKTIENQEKQZ 

MKIQDLLRKDVMLLDLQATEKTAVIDEMIKNLTDHGYVTDFETFKEGILAREALTSTGLGDGIAMPHSKNAAVKEATV 
LFAKSNKGVDYESLDGQATDLFFMIAAPEGANDTHLAALAELSQYLMKDGFADKLRQATSADQVIELFDQASEKTEEL 

55 VQAPANDSGDFIVAVTACTTGIAHTYMAQEALQKVAAEMGVGIKVETNGASGVGNQLTAEDIRKAKAIIIAADKAVEM 
DRFDGKPLINRPVADGIRKTEELINLALSGDTEVYRAANGAKAATASNEKQSLGGALYKHLMSGVSQMLPFVIGGGIMI 
ALAFLIDGALGVPNENLGNLGSYHELASMFMKIGGAAFGLMLPVFAGYVAYSIAEKPGLVAGFVAGAIAKEGFAFGKIP 
YAAGGEATSTLAGVSSGFLGALVGGFIAGALVLAIKKYVKVPRSLEGAKSILLLPLLGTILTGFVMLAVNIPMAAINTAM 
NDFLGGLGGGSAVLLGIVLGGMMAVDMGGPVNKAAYVFGTGTLAATVSSGGSVAMAAVMAGGMVPPLAIFVATLLF 

60 KDKFTKEERNSGLTNIIMGLSFITEGAIPFGAADPARAIPSFILGSAVAGGLVGLTGIKLMAPHGGIFVIALTSNALLYLVS 
VLVGAIVSGVVYGYLRKPQAZ 

MANKNTSTTRRRPSKAELERKEAIQRMLISLGIAILLIFAAFKLGAAGITLYNLIRLLVGSLAYLAIFGLLIYLFFFKWIRK 
65 QEGLLSGFFTIFAGLLLIFEAYLVWKYGLDKSVLKGTMAQVVTDLTGFRTTSFAGGGLIGVALYIPTAFLFSNIGTYFIGS 
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ILILVGSLLVSPWSVYDIAEFFSRGFAKWWEGHERRKEERFVKQEEKARQKAEKEARLEQEETEKALLDLPPVDMETGE 
ILTEEAVQNLPPIPEEKWVEPEIILPQAELKFPEQEDDSDDEDVQVDFSAKEALEYKLPSLQLFAPDKPKDQSKEKKIVRE 
NIKILEATFASFGIKVTVERAEIGPSVTKYEVKPAVGVRVNRISNLSDDLALALAAKDVRIEAPIPGKSLIGIEVPNSDIATV 
SFRELWEQSQTKAENFLEIPLGKAVNGTARAFDLSKMPHLLVAGSTGSGKSVAVNGIIASILMKARPDQVKFMMVDPK 
5 MVELSVYNDIPHLLIPVVTNPRKASKALQKVVDEMENRYELFAKVGVRNIAGFNAKVEEFNSQSEYKQIPLPFIVVIVDE 
LADLMMVASKEVEDAIIRLGQKARAAGIHMILATQRPSVDVISGLIKANVPSRVAFAVSSGTDSRTILDENGAEKLLGRG 
DMLFKPIDENHPVRLQGSFISDDDVER1VNFIKTQADADYDESFDPGEVSENEGEFSDGDAGGDPLFEEAKSLVIETQKA 
SASMIQRRLSVGFNRATRLMEELEIAGVIGPAEGTKPRKVLQQZ 

10 

MSYFKKYKFDKSQFKLGMRTFKTGIAVFLVLLIFGFFGWKGLQIGALTAVFSLRESFDESVHFGTSRILGNSIGGLYALV 
FFLLNTFFHEAF\WTLVVVPICTMLTIMTNVAMNNKAGVIGGVAAMLIITLSIPSGETILYVFVRVLETFMGVFVAIIVN 
YDIDRIRLFLEKKEKZ 

15 

MNKSEHRHQLIRALITKNKIHTQAELQALLAENDIQVTQATLSRDIKNMNLSKVREEDSAYYVLNNGSISKWEKRLELY 
MEDALVWMRPVQHQVLLKTLPGLAQSFGSIIDTLSFPDAIATLCGNDVCLIICEDADTAQKCFEELKKFAPPFFFEEZ 



20 MKSIKLNALSYMGIRVLNIIFPILTGTYVARVLDRTDYGYFNSVDTILSFFLPFATYGVYNYGLRAISNVKDNKKDLNRT 
FSSLFYLCIACTILTTAVYILAYPLFFTDNPIVKKVYLVMGIQLIAQIFSIEWVNEALENYSFLFYKTAFIRILMLVSIFLFVK 
NEHDIVVYTLVMSLSTLINYUSYFWIKRDIKLVKIHLSDFKPLFLPLTAMLVFANANMLFTFLDRLFLVKTGIDVNVSY 
YTIAQR1VTVIAGVVTGAIGVSVPRLSYYLGKGDKEAYVSLVNRGSRIFNFFIIPLSFGLMVLGPNAILLYGSEKYIGGGIL 
TSLFAFRTIILALDTILGSQILFTNGYEKRITVYTVFAGLLNLGLNSLLFFNHIVAPEYYLLTTMLSETSLLVFYIIFIHRKQL 

25 IHLGHIFSYTVRYSLFSLSFVAIYFLINFVYPVDMVINLPFLINTGLIVLLSAISYISLLVFTKDSIFYEFLNHVLALKNKFKK 
SZ 



MELFMKITNYEIYKLKKSGLTNQQILKVLEYGENVDQELLLGDIADISGCRNPAVFMERYFQIDDAHLSKEFQKFPSFSIL 
30 DDCYPWDLSEIYDAPVLLFYKGNLDLLKFPKVAVVGSRACSKQGAKSVEKVIQGLENELVIVSGLAKGIDTAAHMAAL 
QNGGKTIAVIGTGLDVFYPKANKRLQDYIGNDHLVLSEYGPGEQPLKFHFPARNR1IAGLCRGVIVAEAKMRSGSLITCE 
RAMEEGRDVFAIPGSILDGLSDGCHHLIQEGAKLVTSGQDVLAEFEFZ 



35 MKQLTVEDAKQIELEILDYIDTLCKKHNINYIINYGTLIGAVRHEGFIPWDDDIDLSMPREDYQRFINIFQKEKSKYKLLS 
LETDKNYFNNFIKITDSTTKIIDTRNTKTYESGIFIDIFPIDRFDDPKVIDTCYKLESFKLLSFSKHKNIVYKDSLLKDWIRT 
AFWLLLRPVSPRYFANKIEKEIQKYSRENGQYMAFIPSKFKEKEVFPSGTFDKTIDLPFENLSLPAPEKFDTILTQFYGDY 
MTLPPEEKRFYSHEFHAYKLEDZ 

40 

MIKINHLTITQNKDLRDLVSDLTMTIQDGEKVAIIGEEGNGKSTLLKILMGEALSDFTIKGNIQSDYQSLAYIPQKVPEDL 
KKKTLHDYFFLDSIDLDYSILYRLAEELHFDSNRFASDQEIGNLSGGEALKIQLIHELAKPFEILFLDEPSNDLDLETVDW 
LKGQIQKTRQTVIFISHDEDFLSETADTIVHLRLVKHRKEAETLVEHLDYDSYSEQRKANFAKQSQQAANNQRAYDKT 
MEKHRRVKQNVETALRATKDSTAGRLLAKKMKTVLSQEKRYEKAAQSMTQKPLEEEQIQLFFSDIQPLPASKVLVQLE 
45 KENLSIDDRVLVQKLQLTVRGQEKIGIIGPNGVGKSTLLAKLQRLLNDKREISLGFMPQDYHKKLQLDLSPIAYLSKTGE 
KEELQKIQSHLASLNFSYPEMQHQIRSLSGGQQGKLLLLDLVLRKPNFLLLDEPTRNFSPTSQPQIRKLFATYPGGLITVS 
HDRRFLKEVCSIIYRMTEHGLKLVNLEDLZ 



50 MKPKTFYNLLAEQNLPLSDQQKEQFERYFELLVEWNEKINLTAITDKEEVYLKHFYDSIAPILQGLIPNETIKLLDIGAGA 
GFPSLPMKILYPELDVTIIDSLNKRINFLQLLAQELDLNGVHFYHGRAEDFAQDKNFRAQYDFVTARAVARMQVLSELT 
IPYLKVGGKLLALKASNAPEELLEAKNALNLLFSKVEDNLSYALPNRDPRYITVVEKKKETPNKYPRKAGMPNKRPLZ 



55 MSIKLJAVDIDGTLVNSQKEITPEVFSAIQDAKEAGVKVVIATGRPIAGVAKLLDDLQLRDEGDYVVTFNGALVQETATG 
HEIISESLTYEDYLDMEFLSRKLGVHMHAITKDGIYTANRNIGKYTVHESTLVSMPIFYRTPEEMAGKEIVKCMFIDEPEI 
LDAAIEKIPAEFYERYSINKSAPFYLELLKKNVDKGSAITHLAEKLGLTKDETMAIGDEENDRAMLEVVGNPVVMENGN 
PEIKKIAKYITKTNDESGVAHAIRTWVLZ 

60 

MTWIILGVIALIVIFVIVSYNGLVKNRMQTKEAWSQIDVQLKRRNDLLPNLIETVKGYAKYEGSTLEKVAELRNQVAAA 
TSPAEAMKASDALTRQVSGIFAVAESYPDLKASANFVKLQEELTNTENKISYSRQLYNSVVSNYNVKLETFPSNIIAGMF 
GFKAADFLQTPEEEKSVPKVDFSGLGDZ 

65 
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MLFDQIASNKRKTWILLLVFFLLLALVGYAVGYLFIRSGLGGLVIALIIGFIYALSMIFQSTEIVMSMNGAREVDEQTAPD 
LYHVVEDMALVAQIPMPRVFIIDDPALNAFATGSNPQNAAVAATSGLLAIMNRJEELEAVMGHEVSHIRNYDIRISTIAV 
ALASAITMLSSMAGRMMWWGGAGRRRSDDDRDGNGLEIIMLVVSLLAIVLAPLAATLVQLAISRQREFLADASSVELT 
RNPQGMINALDKLDNSKPMSRHVDDASSALYINDPKKGGGFQKLFYTHPPISERIERLKQMZ 

MKLNIQEIRKQSEGLNFEQTLDLVDDLRARNQEILDVKDILAVGKVQYEDRMYFLDYQLSYTIVLASSRSMEPVELVES 
YPVTEVFMEGATNQLDQEVLDDDLVLPIENGELDLAESVSDNILLNIPIKVLTAEEEAGQGFISGNDWQIMTEEEYQAQ 
KAVKKEENSPFAGLQGLFDGDEZ 

MKRQLALVVFSGGQDSTTCLFWVMQHYETVEAVTFAYGQRHHLEIQITREIAKEQGIRHHILDMSLLGQITAQPDFATI 
HISYIPDKLCVESKSLKLYLFSYRNHGDFHENCINTIGKDLVNLLDPRYLEVWGKFTPRGGISIDPYYNYGKQGTKYEGL 
AEQRLFQHDLYPEKIDNRZ 

MTETVEDKVSHSITGLDILKGIVAAGAVISGTVATQTKVFTNESAVLEKTVEKTDALATNDTVVLGTISTSNSASSTSLSA 
SESASTSASESASTSASTSASTSASESASTSASTSISASSTVVGSQTAAATEATAKKVEEDRKKPASDYVASVTNVNLQSYA 
KRRKRSVDSIEQLLASIKNAAVFSGNTIVNGAPAINASLNIAKSETKVYTGEGVDSVYRVPIYYKLKVTNDGSKLTFTYT 

20 VTYVNPKTNDLGNISSMRPGYSIYNSGTSTQTMLTLGSDLGKPSGVKNYITDKNGRQVLSYNTSTMTTQGSGYTWGNG 
AQMNGFFAKKGYGLTSSWTVPITGTDTSFTFTPYAARTDRIGINYFNGGGKVVESSTTSQSLSQSKSLSVSASQSASASAS 
TSASASASTSASASASTSASASASTSASVSASTSASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSASE 
SASTSASASASTSASESASTSASASASTSASASASTSASGSASTSTSASASTSASASASTSASASASISASESASTSASESASTST 
SASASTSASESASTSASASASTSASASASTSASASASTSASASTSASESASTSASASASTSASASASTSASASASTSASASASTS 

25 ASVSASTSASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSASESASTSASASASTSASESASTSASASA 
STSASASASTSASGSASTSTSASASTSASASASTSASASASISASESASTSASESASTSTSASASTSASESASTSASASASTSASA 
SASTSASASARQVRRPQPVHLNRHQPVRQPQQVLVHQLQHQRVHRLQHQPVPRLQRQPVRQLQQVPVLQSQHQQVLQ 
PQHRQVPRLQQAHQHLNQRRQAPQLQQVPVRQPQRRQVRQPQQVLVHQLQHQRVHRLRRQPVHQSQQVPVRQLPHQ 
QVPRLQQAPVRRLQQVLAPQPQPQPVRQPQQVSQRLNRHQRVRPLQQVLAPQPQRQQVHRLQRQRVRLNRHQRVRPL 

30 QQVLAPQPQRQQVHRLQHQRVRPLQQVLAPQPQRQQVHRLQRQRVRLSQHQRVRQPQQAHQLLNLHQPVRQPQHRQ 
APQLQQVPVRQPQRRQVRRLQQVPVRQPQQVPVRQPQRRQVRRPQPVHLNRHQPVRQPQQVLVHQLQHQRVHRLQH 
QPVHQSQQVPVRQFRINKCLGFSKYZ 

35 MGVETWFYSSICWLAIGLGSVWKFPYMTAANGGGGFLLIFLISTILIGFPLLLAEFALGRSAGVSAIKTFGKLGKNNKYN 
FIGWIGAFALFILLSFYSVIGGWILVYLGIEFGKLFQLGGTGDYAQLFTSIISNPAIALGAQAAFILLNIFIVSRGVQKGIERA 
SKVMMPLLFIVFVFIIGRSLSLPNAMEGVLYFLKPDFSKLTSTGLLYALGQSFFALSLGVTVMLTYASYLDKKTNLVQSG 
ISIVAMNISISIMAGLAIFQARSPFNIQSEGGPSLLFIVLPQLFDKMPFGTIFYVLFLLLFLFATVTFSVVMLEINVDNITNQD 
NSKRAKWSVILGILTFVFGIPSALSYGVMADVHIFGKTFFDAMDFLVSNLLMPFGALYLSLFTGYIFKKALAMEELHLD 

40 ERAWKQGLFQVWLFLLRFFVSSFQSSSLWSSLPNLCNQKGLEZ 

MLKKWQLKDVILLAFLSIFFGGVFVGSGYVYNILSLLLTPLGLQAFANEILFGLWCMAAPIAAIFVPRVGSATIGEVLAA 
LAEVLYGSQFGLGALLSGFVQGLGSEFGFIVTKNRYESWLSLTANSIGITLVSFVYEYIKLGYYAFSLPFVLSLLVVRFISV 
45 YFFCTILVRAIVKLYHQFATGGKAZ 

MVKVATQTPIISLFLLILSLETSFIPSIALTLSVVAFCILFMLYYRRFKMLAWMIILAILPSFANYWAVQLHGDASQAVML 
GTRAFVTVCIGLVFVSSVSLKELLLYLAQKGLSRSWSYALIVVFNSFPLIQQEIKSLKEACLLRGQELHFWSPLIYSKVLM 
50 TVFRWRHLYLRALSAHGYDEHAQLKNSYRTFYIPKKTKLIYLLFFLLLQTSLFLZ 

MRKHQLQVHKLTILSMMIALDVVLTPIFRIEGMAPMSSVVNILAGIMMGPVYALAMATVTAFIRMTTQGIPPLALTGAT 
FGALLAGLFYKYGRKFHYSALGEILGTGIIGSIVSYPVMVLFTGSAAKLSWFIYTPRFFGATLIGTAISFIAFRFLIKQEFFK 
55 KVQGYFFSERIDZ 

MQEFTNPFPIGSSSLIHCITNEISCEMLANGILALGCKPVMADDSREVLDFTKQSQALFINLGHLSAEKEKAIRMAASYAN 
QSSLPMVVDAVGVTTSSIRKSLVKDLLDYRPTVLKGNMSEIRSLVGLKHHGVGVDASAKDQETEDLLQVLKDWCQTYP 
60 GMSFLVTGPKDLVVSKNQVAVLGNGCTELDWITGTGDLVGALTAVFLSQGKTGFEASCLAVSYLNIAAEKIVVQGMG 
LEEFRYQVLNQLSLLRRDENWLDTIKGEVYEZ 

MNHKIAILSDVHGNATALEAVIADAKNQGASEYWLLGDIFLPGPGANDLVALLKDLPITASVRGNWDDRVLEALDGQ 
65 YGLEDPQEVQLLRMTQYLMERMDPATIVWLRSLPLLEKKEIDGLRFSISHNLPDKNYGGDLLVENDTEKFDQLLDAET 
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DVAVYGHVHKQLLRYGSQGQQIINPGSIGMPYFNWEALKNHRSQYAVIEVEDGELLNIQFRKVAYDYEAELELAKSKG 
LPFIEMYEELRRDDNYQGHNLELLASLIEKHGYVEDVKNFFDFLZ 

5 MNVNQIVRIIPTLKANNRKLNETFYIETLGMKALLEESAFLSLGDQTGLEKLVLEEAPSMRTRKVEGRKXLARLIVKVE 
NPLEIEGILSKTDSIHRLYKGQNGYAFEIFSPEDDLILIHAEDDIASLVEVGEKPEFQTDLASISLSKFEISMELHLPTDIESF 
LESSEIGASLDFIPAQGQDLTVDNTVTWDLSMLKFLVNELDIASLRQKFESTEYFIPKSEKFFLGKDRNNVELWFEEVZ 

10 MKWTKIIKKIEEQIEAGIYPGASFAYFKDNQWTEFYLGQSDPEHGLQTEAGLVYDLASVSKVVGVGTVCTFLWEIGQLD 
IDRLVIDFLPESDYPDITIRQLLTHATDLDPFIPNRDLLTAPELKEAMFHLNRRSQPAFLYSDVHFLLLGFILERIFNQDLD 
VILKDQVWKPWGMTETKFGPVELAVPTVRGVEAGIVHDPKARLLGRHAGSAGLFSTIKDLQIFLEHYLADDFARDLNQ 
NFSPLDDKERSLAWNLEGDWLDHTGYTGTFIMWNRQKQEATIFLSNRTYEKDERAQWILDRNQVMNLIRKEEZ 

15 

MMKKTYNHILVWGVIFYSICIVCFCFTPQEQSTVGVGTPGIQHLGRLVFLLTPFNSLWKLGEVSDIGQLCWIFLQNILNV 
FLFFPLIFQLLYLFPNLRKTKKVLLFSFLVSLGIECTQLILDFFFDFNRVFEIDDLWTNTLGGYLAWLLYKRLHKNKVRN 
Z 

20 

MKIPLLTFARHKFVYVLLTLLFLALVYRDVLMTYFFFDIHAPDLAKFDGQAIKNDLLKSALDFRILQFNLGFYQSFIIPIII 
VLLGFQYIELKNKVLRLSIGREVSYQGLKRKLTLQVASIPCLIYLVTVLIIAIITYFFGTFSPLGWNSLFSDGSGLQRLLDGE 
IKSYLFFTCVLLIGIFINAIYFLQIVDYVGNVTRSAITYLMFLWLGSMLLYSALPYYMVPMTSLMQASYGDVSLMKLFTP 
YILYrVPYMVLEKYEDNVZ 

25 

MFKVLQKVGKAFMLPIAILPAAGLLLGIGGALSNPTTIATYPILDNSIFQSIFQVMSSAGEVVFSNLSLLLCVGLCIGLAKR 
DKGTAALAGVTGYLVMTATIKALVKLFMAEGSAIDTGVIGALVVGIVAVYLHNRYNNIQLPSALGFFGGSRFVPIVTSF 
SSILIGFVFFVIWPPFQQLLVSTGGYISQAGPIGTFLYGFLMRLSGAVGLHHIIYPMFWYTELGGVETVAGQTVVGAQKIF 
30 FAQLADLAHSGLFTEGTRFFAGRFSTMMFGLPAACLAMYHSVPKNRRKKYAGLFFGVALTSFITGITEPIEFMFLFVSPV 
LYVVHAFLDGVSFFIADVLNISIGNTFSGGVIDFTLFGILQGNAKTNWVLQIPFGLIWSVLYYIIFRWFITQFNVLTPGRGE 
EVDSKEISESADSTSNTADYLKQDSLQIIRALGGSNNIEDVDACVTRLRVAVKEVNQVDKALLKQIGAVDVLEVKGGIQ 
AIYGAKAILYKNSINEILGVDDZ 

35 MKFRKLACTVLAGAAVLGLAACGNSGGSKDAAKSGGDGAKTEITWWAFPVFTQEKTGDGVGTYEKSIIEAFEICANPDI 
KVKLETIDFKSGPEKITTAIEAGTAPDVLFDAPGRIIQYGKNGKLAELNDLFTDEFVKDVNNENIVQASKAGDKAYMYPI 
SSAPFYMAMNKKMLEDAGVANLVKEGWTTDDFEKVLKALKDKGYTPGSLFSSGQGGDQGTRAFISNLYSGSVTDEKV 
SKYTTDDPKFVKGLEKATSWIKDNLINNGSQFDGGADIQNFANGQTSYTILWAPAQNGIQAKLLEASKVEVVEVPFPSD 
EGKPALEYLVNGFAVFNNKDDKKVAASKKFIQFIADDKEWGPKDVVRTGAFPVRTSFGKLYEDKRMETISGWTQYYSP 

40 YYNTIDGFAEMRTLWFPMLQSVSNGDEKPADALKAFTEKANETIKKAMKQZ 

MQSTEKKPLTAFTVISTIILLLLTVLFIFPFYWILTGAFKSQPDTIVIPPQWFPKMPTMENFQQLMVQNPALQWMWNSVFI 
SLVTMFLVCATSSLAGYVLAKKRFYGQRILFAIFIAAMALPKQVVLVPLVRIVNFMGIHDTLWAVILPLIGWPFGVFLM 
45 KQFSENIPTELLESAKIDGCGEIRTFWSVAFPIVKPGFAALAIFTFINTWNDYFMQLVMLTSRNNLTISLGVATMQAEMA 
TNYGLIMAGAALAAVPIVTVFLVFQKSFTQGITMGAVKGZ 

MKIMFKNFNNILLNRKIVLLLRIVLMMILINHLLSTAVQKQDAVIFFKRELISIFSYNDYSEANLEIPKLLLNLSLFMVGW 
50 LSVILLESDLADHYHHLIRYQSSSFFDYTRKRLVVISKFFTQDLFVWFLGLLPLGIHFKTVALFFLLAQLMMLYLLLSYLI 
ALISAGAGFSFFLYFLAFVGQEWMMDHIVTVYLVLLSLLVMLIVSRLEEKFKKGZ 

MGKGEMGKGVIGLEFDSEVLVNKAPTLQLANGKTATFLTQYDSKTLLFAVDKEDIGQEIIGIAKGSIESMHNLPVNLAG 
55 ARVPGGVNGSKAAVHEVPEFTGGVNGTEPAVHEIAEYKGSDSLVTLTTKKDYTYKAPLAQQALPETGNKESDLLASLG 
LTAFFLGLFTLGKXREQZ 

60 MKKTFFLLVLGLFCLLPLSVFAIDFKINSYQGDLYIHADNTAEFRQKIVYQFEEDFKGQIVGLGRAGKMPSGFDIDPHPKI 
QAAKNGAELADVTSEVTEEADGYTVRVYNPGQEGDIVEVDLVWNLKNLLFLYDDIAELNWQPLTDSSESIEKFEFHVR 
GDKGAEKLFFHTGKLFREGTIEKSNLDYTIRLDNLPAKRGVELHAYWPRTDFASARDQGLKGNRLEEFNKIEDSIVREK 
DQSKQLVTWVLPSILSISLLLSVCFYFIYRRKTTPSVKYAKNHRLYEPPMELEPMVLSEAVYSTSLEEVSPLVKGAGKFTF 
DQLIQATLLDVIDRGNVSIISEGDAVGLRLVKEDGLSSFEKDCLNLAFSGKKEETLSNLFADYKVSDSLYRRAKVSDEKR 

65 IQARGLQLKSSFEEVLNQMQEGVRKRVSFWGLPDYYRPLTGGEKALQVGMGALTILPLFIGFGLFLYSLDVHGYLYLPL 
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PILGFLGLVLSVFYYWKLRLDNRDGVLNEAGAEVYYLWTSFENMLREIARLDQAELESIVVWNRLLVYATLFGYADK 
VSHLMKVHQIQVENPDINLYVAYGWHSTFYHSTAQMSHYASVANTASTYSVSSGSGSSGGGFSGGGGGGSIGAFZ 



5 MKKVRKIFQKAVAGLCCISQLTAFSSIVALAETPETSPAIGKVVIKETGEGGALLGDAVFELKNNTDGTTVSQRTEAQTG 
EAIFSNIKPGTYTLTEAQPPVGYKPSTKQWTVEVEKNGRTTVQGEQVENREEALSDQYPQTGTYPDVQTPYQIIKVDGS 
EKNGQHKALNPNPYERVIPEGTLSKRIYQVNNLDDNQYGIELTVSGKTVYEQKDKSVPLDVVILLDNSNSMSNIRNKNA 
RRAERAGEATRSLIDKITSDSENRVALVTYASTIFDGTEFTVEKGVADKNGKRLNDSLFWNYDQTSFTTNTKDYSYLKL 
TNDKNDIVELKNKVPTEAEDHDGNRLMYQFGATFTQKALMKADEILTQQARQNSQKVIFHITDGVPTMSYPINFNHAT 

10 FAPSYQNQLNAFFSKSPNKDGILLSDFITQATSGEHTIVRGDGQSYQMFTDKTVYEKGAPAAFPVKPEKYSEMKAAGYA 
VIGDPINGGYIWLNWRESILAYPFNSNTAKITNHGDPTRWYYNGNIAPDGYDVFTVGIGINGDPGTDEATATSFMQSISS 
KPENYTNVTDTTKILEQLNRYFHTIVTEKKSIENGTITDPMGELIDLQLGTDGRFDPADYTLTANDGSRLENGQAVGGP 
QNDGGLLKNAKVLYDTTEKRIRVTGLYLGTDEKVTLTYNVRLNDEFVSNKFYDTNGRTTLHPKEVEQNTVRDFPIPKI 
RDVRKYPEITISKEKKLGDIEFIKVNKNDKKPLRGAVFSLQKQHPDYPDIYGAIDQNGTYQNVRTGEDGKLTFKNLSDG 

15 KYRLFENSEPAGYKPVQNKPIVAFQIVNGEVRDVTSIVPQDIPAGYEFTNDKHYITNEPIPPKREYPRTGGIGMLPFYLIG 
CMMMGGVLLYTRKHPZ 



MKSINKFLTMLAALLLTASSLFSAATVFAAGTTTTSVTVHKLLATDGDMDKIANELETGNYAGNKVGVLPANAKEIAG 
20 VMFVWTNTNNEIIDENGQTLGVNIDPQTFKLSGAMPATAMKKLTEAEGAKFNTANLPAAKYKIYEIHSLSTYVGEDGA 
TLTGSKAVPIEIELPLNDVVDAHVYPKNTEAKPKIDKDFKGKANPDTPRVDKDTPVNHQVGDVVEYEIVTKIPALANYA 
TANWSDRMTEGLAFNKGTVKVTVDDVALEAGDYALTEVATGFDLKLTDAGLAKVNDQNAEKTVKITYSATLNDKAI 
VEVPESNDVTFNYGNNPDHGNTPKPNKPNENGDLTLTKTWVDATGAPIPAGAEATFDLVNAQTGKVVQTVTLTTDKN 
TVTVNGLDKNTEYKFVERSIKGYSADYQEITTAGEIAVKNWKDENPKPLDPTEPKVVTYGKKFVKVNDKDNRLAGAEF 
25 VIANADNAGQYLARKADKVSQEEKQLVVTTKDALDRAVAAYNALTAQQQTQQEKEKVDKAQAAYNAAVIAANNAF 
EWVADKDNENVVKLVSDAQGRFEITGLLAGTYYLEETKQPAGYALLTSRQKFEVTATSYSATGQGIEYTAGSGKDDAT 
KVVNKKITIPQTGGIGTIIFAVAGAAIMGIAVYAYVKNNKDEDQLAZ 



30 MTMQKMQKMISRIFFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDDRV 
QIVRDLHSWDENKLSSFKKTSFEMTFLENQIEVSHIPNGLYYVRSIIQTDAVSYPAEFLFEMTDQTVEPLVIVAKKTDTM 
TTKVKLIKVDQDHNRLEGVGFKLVSVARDVSEKEVPLIGEYRYSSSGQVGRTLYTDKNGEIFVTNLPLGNYRFKEVEPL 
AGYAVTTLDTDVQLVDHQLVTITVVNQKLPRGNVDFMKVDGRTNTSLQGAMFKVMKEESGHYTPVLQNGKEVVVTS 
GKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTIGKDTRKELVTVVKNNKRPRIDVPDTGEETLVYLDACCHFV 

35 VWZ 



MSHIYLSIFTSLLLxMLGLVNVAQADEYLRlGMEAAYAPFNWTQDDDSNGAVKIDGTNQYANGYDVQIAKKIAKDLGKE 
PLVVKTKWEGLVPALTSGKIDMIIAGMSPTAERKQEIAFSSSYYTSEPVLLVKKDSAYASAKSLDDFNGAKITSQQGVYL 

40 YNLIAQIPGAKKETAMGDFAQMRQALEAGVIDAYVSERPEALTAEAANSKFKMIQVEPGFKTGEEDTAIAIGLRKNDNR 
ISQINASIETISKDDQVALMDRMIKEQPAEATTTEETSSSFFSQVAKILSENWQQLLRGAGITLLISrVGTIIGLIIGLAIGVFR 
TAPLSENKVIYGLQKLVGWVLNVYIEIFRGTPMIVQSMVIYYGTAQAFGINLDRTLAAIFIVSINTGAYMTEIVRGGILAV 
DKGQFEAATALGMTHNQTMRKIVLPQVVRNILPATGNEFVINIKDTSVLNVISVVELYFSGNTVATQTYQYFQTFTIIAV 

^ IYFVLTFTVTRILRFIERRMDMDTYTTGANQMQTEDLKZ 

MTQAILEIKHLKKSYGQNEVLKDISLTVHKGEVISIIGSSGSGKSTFLRSINLLETPTDGQILYHGQNVLEKGYDLTQYREK 
LGMVFQSFNLFENLNVLENTIVAQTTVLKRERTEAEKIAKENLEKVGMGERYWQAKPKQLSGGQKQRVAIARALSMN 
PDAILFDEPTSALDPEMVGEVLKIMQDLAQEGLTMIVVTHEMEFARDVSHRVIFMDKGVIAEEGKPEDLFTNPKEDRTIC 
50 EFLQRYLKZ 



MKKYQLLFKISAVFSYLFFVFSLSQLTLIVQNYWQFSSQIGNLFWIQNILSLLFIGVMIVVLVKTGHGYLFRIPRKKWLW 
YSILTVLVLVFQISFNVQTAKHVQSTAEGWAVLIGYSGTNFAELGIYIALFFLVPLMEELIYRGLLQHAFFKHSRFGLDLL 
55 LPSILFALPHFSSLPSLLDIFVFATVGIIFAGLTRYTKSIYPSYAVHVINNIVATFPFLLTFLHRVLGZ 



MNKKQWLGLGLVAVAAVGLAACGNRSSRNAASSSDVKTKAAIVTDTGGVDDKSFNQSAWEGLQAWGKEHNLSKDN 
GFTYFQSTSEADYANNLQQAAGSYNLIFGVGFALNNAVKDAAKEHTDLNYVLIDDVIKDQKNVASVTFADNESGYLA 
60 GVAAAKTTKTKQVGFVGGIESEVISRFEAGFKAGVASVDPSIKVQVDYAGSFGDAAKGKTIAAAQYAAGADIVYQVAG 
GTGAGVFAEAKSLNESRPENEKVWVIGVDRDQEAEGKYTSKDGKESNFVLVSTLKQVGTTVKDISNKAERGEFPGGQV 
IVYSLKDBCGVDLAVTNLSEEGKKAVEDAKAKILDGSVKVPEKZ 
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MSKKLQQISVPLISVFLGILLGAIVMWIFGYDAIWGYEELFYTAFGSLRGIGEIFRAMGPLVLIGLGFAVASRAGFFNVGL 
PGQALAGWII^GWFAI^HPDMPRPLMILATIVIALIAGGIVGAIPGILRAYLGTSEVIVTIMMNYIVLYVGNAFIHAFPKD 
FMQSTDSTIRVGANATYQTPWLAELTGNSRMNIGIFFAIIAVAVIWFMLKKTTLGFEIRAVGLNPHASEYAGISAKRTIIL 
SMIISGALAGLGGAVEGLGTFQNVYVQGSSLAIGFNGMAVSLLAANSPIGILFAAFLFGVLQVGAPGMNAAQVPSELVSI 
5 VTASIIFFVSVHYLIERFVKPKBCQVKGGKZ 



MGVKKKLKLTSLLGLSLLIMTACATNGVTSDITAESADFWSKLVYFFAEIIRFLSFDISIGVGIILFTVLIRTVLLPVFQVQ 
MVASRKMQEAQPRIKALREQYPGRDMESRTKLEQEMRKVFKEMGVRQSDSLWPILIQMPVILALFQALSRVDFLFCTGH 
10 FLWINLGSVDTTLVLPILAAVFTFLSTWLSNKALSERNGATTAMMYGIPVUFIFAVYAPGGVALYWTVSNAYQVLQTY 
FLNNPFKIIAEREAVVQAQKDLENRKRKAKKKAQKTKZ 



MVIDPFAINELDYYLVSHFHSDHIDPYTAAAILNNPKLEHVKFIGPYHCGRIWEGWGVPKERIIVVKPGDTIELKDMKIH 
15 AVESFDRTCLVTLPVNGADETGGELAGLAVTDEEMAQKAVNYIFETPGGTIYHGADSHFSNYFAKHGKDFKIDVALNN 
YGENPVGIQDKMTSIDLLRMAENLRTKVIIPVHYDIWSNFMASTNEILELWKMRKDRLQYDFHPFIWEVGGKYTYPQD 
QHLVEYHHPRGFDDCFEQDSNIQFKALLZ 



20 MFLSGWLSSFANTYIHDLLGVLFPDSPFLNAFESAIAAPLVEEPLKLLSLVFVLALIPVRKLKSLFLLGIASGLGFQMIKDI 
GYIRTDLPEGFDFTISRILERIISGIASHWTFSGLAVVGVYLLYRAYKGQKVGKKQGLIFLGLALGTHFLFNSPFVELETEL 
PLAIPVVTAIALYGFYHAYCFVEKHNELMTZ 



25 MKVEPRCDVLSRMSHFFIRILIMELQELVERSWAIRQAYHELEVKHHDSKWTVEEDLLALSNDIGNFQRLVMTKQGRY 
YDETPYTLEQKLSENIWWLLELSQRLDIDILTEMENFLSDKEKQLNVRTWKZ 



MLDWKQFFLAYLRSRSRLFIYLLSLAFLVLLFQFLFASLGIYFLYFFFLCCFVTILFFTWDILVETQVYRQELLYGEREAK 
30 SPLEIALAEKLEAREMELYQQRSKAERKLTDLLDYYTLWVHQIKTPIAASQLLVAEVVDRQLKQQLEQEIFKIDSYTNLV 
LQYLRLESFHDDLVLKQVQIEDLVKEIIRKYALFFIQKGLNVNLHDLDKEIVTDKKWLLVVIEQIISNSLKYTKEGGLEIY 
MDDQELCIKDTGIGIKNSDVLRVFERGFSGYNGRLTQQSSGLGLYLSKKISEELGHQIRIESEVGKGTTVRIQFAQVNLVL 
EZ 

35 

MELNTHNAEILLSAANKSHYPQDELPEIALAGRSNVGKSSFINTMLNRKNLARTSGKPGKTQLLNFFNIDDKMRFVDVP 
GYGYARVSKKEREKWGCMIEEYLTTRENLRAVVSLVDLRHDPSADDVQMYEFLKYYEIPVIIVATKADKIPRGKWNKH 
ESAIKKKLNFDPSDDFILFSSVSKAGMDEAWDAILEKLZ 

40 

MTKKQLHLVIVTGMSGAGKTVAIQSFEDLGYFTIDNMPPALLPKFLQLVEIKEDNPKLALVVDMRSRSFFSEIQAVLDEL 
ENQDGLDFKILFLDAADKELVARYKETRRSHPLAADGRILDGIKLERELLAPLKNMSQNVVDTTELTPRELRKTLAEQF 
SDQEQAQSFRIEVMSFGFKYGIPIDADLVFDVRFLPNPYYLPELRNQTGVDEPVYDYVMNHPESEDFYQHLLALIEPILP 
SYQKEGKSVLTIAMGCTGGQHRSVAFAKRLAQDLSKNWSVNEGHRDKDRRKETVNRSZ 

45 

MRKPKITVIGGGTGSPVILKSLREKDVEIAAIVTVADDGGSSGELRKNMQQLTPPGDLRNVLVAMSDMPKFYEKVFQYR 
FSEDAGAFAGHPLGNLIIAGLSEMQGSTYNAMQLLSKFFHTTGKIYPSSDHPLTLHAVFQDGTEVAGESHIVDHRGIIDN 
VYVTNALNDDTPLASRRVVQTILESDMIVLGPGSLFTSILPNIVIKEIGRALLETKAEIAYVCNIMTQRGETEHFTDSDHV 
50 EVLHRHLGRPFIDTVLVNIEKVPQEYMNSNRFDEYLVQVEHDFVGLCKQVSRVISSNFLRLENGGAFHDGDLIVDELMR 
I 

IQVKKZ 



55 MKNLIKLLIIRLIVNLADSVFYIVALWHVSNNYSSSMFLGIFIAVNYLPDLLLIFFGPVIDRVNPQKILIISILVQLAVAVIFL 
LLLNQISFWVIMSLVFISVMASSISYVIEDVLIPQVVEYDKIVFANSLFSISYKVLDSIFNSFASFLQVAVGFILLVKIDIGIFL 
LALFILLLLKFRTSNANIENFSFKYYKREVLQGTKFILNNKLLFKTSISLTLINFFYSFQTVVVPIFSIRYFDGPIFYGIFLTIA 
GLGGILGNMLAPIVIKYLKSNQIVGVFLFLNGSSWLVAIVIKDYTLSLILFFVCFMSKGVFNIIFNSLYQQIPPHQLLGRVN 
TTIDSIISFGMPIGSLVAGTLIDLNIELVLIAISIPYFLFSYIFYTDNGLKEFSIYZ 

60 

MMSNKNKEILIFAILYTVLFMFDGVKLLASLMPSAIANYLVYVVLALYGSFLFKDRLIQQWKEIRKTKRKFFFGVLTGW 
LFLILMTVVFEFVSEMLKQFVGLDGQGLNQSNIQSTFQEQPLLIAVFACVIGPLVEELFFRQVLLHYLQERLSGLLSHLV 
GLVFALTHMHSLALSEWIGAVGYLGGGLAFSIIYVKEKENIYYPLLVHMLSNSLSLIILAISIVKZ 
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LKKPIIEFKNVSKVFEDSNTKVLKDINFELEEGKFYTLLGASGSGKSTILNIIAGLLDATTGDIMLDGVRINDIPTNKRDVH 
TVFQSYALFPHMNVFENVAFPLRLRKIDKKEIEQRVAEVLKMVQLEGYEKRSIRKLSGGQRQRVAIARAIINQPRVVLLD 
EPLSALDLKLRTDMQYELRELQQRLGITFVFVTHDQEEALAMSDWIFVMNDGEIVQSGTPVDIYDEPINHFVATFIGESN 
5 ILPGTMIEDYLVEFNGKRFEAVDGGMKPNEPVEVVIRPEDLR1TLPEEGKLQVKVDTQLFRGVHYEIIAYDELGNEWMI 
HSTRKAIVGEEIGLDFEPEDIHIMRLNETEEEFDARIEEYVEIEEQEAGLINAIEEERDEENKLZ 



MKSMRILFLLALIQISLSSCFLWKECILSFKQSTAFFIGSMVFVSGICAGVNYLYTRKQEVHSVLASKKSVKLFYSMLLLIN 
10 LLGAVLVLSDNLFIKNTLQQELVDFLLPSFFFLFGLDLLIFLPLKKYVRDFLAMLDRKKTVLVTILATLLFLRNPMTIVSL 
LIYIGLGLFFAAYLVPNSVKKEVSFYGHIFRDLVLVIVTLIFFZ 



MVICKIIGMVLALLSVTVVGVGVFAYTIYQQGTETLAKTYKKIGEETKVIEATEPLTILLMGVDTGNVERTETWVGRSDS 
15 MILMTVNPKTKKTTMMSLERDILTRIESGNGQAHEAKLNSAYADGGAELAIETIQKMMNIHIDRYVMVNMRGLQKLV 
DAVGGITVNNILGFPISISDQEEFNTISIGVGEQHIGGEEALVYARMRYQDPEGDYGRQKRQREVIQKVMEKALSLNSIGH 
YQEILKALSDNMQTNIDLSAKSIPNLLGYKDSFKTIETQQLQGEGEILQGVSYQIVSRAHMLEMQNLLRRSLGQEEVTQL 
ETNAVLFEDLFGRAPVGDEDNZ 

20 

MKKQAYVIIALTSFLFVFFFSHSLLEILDFDWSIFLHDVEKTEKFVFLLLVFSMSMTCLLALFWRGIEELSLRKMQANLK 
RLLAGQEVVQVADPDLDASFKSLSGKLNLLTEALQKAENQSLAQEEEIIEKERKRIARDLHDTVSQELFAAHMILSGISQ 
QALKLDREKMQTQLQSVTAILETAQKDLRVLLLHLRPVELEQKSLIEGIQILLKELEDKSDLRVSLKQNMTKLPKJCIEEHI 
FRILQELISNTLRHAQASCLDVYLYQTDVELQLKVVDNGIGFQLGSLDDLSYGLRNIKERVEDMAGTVQLLTAPKQGLA 
25 VDIRIPLLDKEZ 



MIVSIISQGFVWAILGLGIFMTFRILNFPDMTTEGSFPLGGAVAVTLITKGVNPFLATLVAVGAGCLAGMAAGLLYTKGK 
IPTLLSGILVMTSCHSIMLLIMGRANLGLLGTKQIQDVLPFDSDLNQLLTGLIFVSIVIALMLFFLDTKLGQAYIATGDNP 
30 DMARSFGIHTGRMELMGLVLSNGVIALAGALIAQQEGYADVSRGIGVIVVGLASLIIGEVIFKSLSLAERLVTIVVGSIAY 
QFLVWAVIALGFNTSYLRLYSALILAVCLMIPTFKQTILKGAKLSKZ 



MKKMKVWSTVLATGVALTTLAACSGGSNSTTASSSEEKADKSQELVIYSNSVSNGRGDWLTAKAKEAGFNIKMVDIAG 
35 AQLADRVIAEKNNAVADMVFGIGAVDSNKIRDQKLLVQYKPKWLDKIDQSLSDKDNYYNPVIVQPLVLIGAPDVKEMP 
KDWTELGSKYKGKYSISGLQGGTGRAILASILVRYLDDKGELGVSEKGWEVAKEYLKNAYTLQKGESSIVKMLDKEDPI 
QYGMMWGSGALVGQKEQNVVFKVMTPEIGVPFVTEQTMVLSTSKKQALAKEFIDWFGQSEIQVEYSKNFGSIPANKD 
ALKDLPEDTKKFVDQVKPQNIDWEAVGKHLDEWVEKAELEYVQZ 

40 

MIKFDNIQIKYGDFVAIDNLNLDIHEGEFFTFLGPSGCGKSTTLRALVGFLDPSSGSIEVNGTDVTHLEPEKRGIGIVFQSY 
ALFPTMTVFDNIAFGLKVKKVAPDVIKAKVSAVAAKIKISDQQLQRNVSELSGGQQQRVALARALVLEPKILCLDEPLS 
NLDAKLRVDLRKELKRLQKELGITTLYVTHDQEEALTLSDR1AVFNNGYIEQVGTPVEIYHNSQTEFVCDFIGDINVLTD 
ETVHEVLLKNTSVFLEDKKGYIRLEKVRFNRETEQDFILKGTIIDVEFSGVTIHYTIKVSESQILNVTSIDSQAAIRSVGESV 
45 ELFITPSDVLQFZ 



MRHKLNLKDWLIRLGLIWFLVTFIIYPNFDLVVNVFVKGGEFSLDAVHRVLKSQRALQSIMNSFKLAFSLIITVNVVGIL 
CVLFTEYFDIKGAKILKLGYMTSLIYGGVVLATGYKFVYGPYGLITKFLQNVIPSLDPNWFIGYGAVLFIMTFSGTANHT 
50 LFLTNTIRSVDYHTIEAARNMGAKPFTVFRKVVLPTLIPTLFALTIMVFLSGLSAVAAPMIVGGKEFQTINPMIITFAGMG 
NSRDLAALLAIILGIATTILLTIMNKIEKGGNYISISKTKAPLKKQKIASKPWNIIAHIVAYGLFTVFMLPLIFIVLYSFTDPV 
AIQTGNLTLSNFTLENYRLFFSNSAAFSPFLVSFIYSIIAATTATILAVVFARVVRKHKSRFDFLFEYGALLPWLLPSTLLA 
VSLLFTFNQPQFLVLNQILVGSLVILLIAYIVVKIPFSYRMVRAILFSVDDEMEDAARSMGASPFYTMMKVIIPFILPVVLS 
VIALNFNSLLTDFDLSVFLYHPLAQPLGITIRSAGDETATSNAQALVFVYTIVLMIISGTVLYFTQRPGRKVRKZ 
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Table 3 

ID201 = 4106.4 

ATGATAAAAAATCCTAAATTATTAACCAAGTCTTTTTTAAGAAGTTTTGCAATTCTAGGTGGTGTTGGTCTAGTCAT 
TCATATAGCTATTTATTTGACCTTTCCTTTTTATTATATTCAACTGGAGGGGGAAAAGTTTAATGAGAGCGCAAGAG 
TGTTTACGGAGTATTTAAAGACTAAGACATCTGATGAAATTCCAAGCTTACTCCAGTCTTATTCAAAGTCCTTGACC 
ATATCTGCTCACCTTAAAAGAGATATTGTAGATAAGCGGCTCCCTCTTGTGCATGACTTGGATATTAAAGATGGAAA 
GCTATCAAATTATATCGTGATGTTAGATATGTCTGTTAGTACAGCAGATGGTAAACAGGTAACCGTGCAATTTGTTC 
ACGGGGTGGATGTCTACAAAGAAGCAAAGAATATTTTGCTTTTGTATCTCCCATATACATTTTTGGTTACAATTGCT 
TTTTCCTTTGTTTTTTCTTATTTTTATACTAAACGCTTGCTCAATCCTCTTTTTTACATTTCAGAAGTGACTAGTAA 
AATGCAAGATTTGGATGACAATATTCGTTTTGATGAAAGTAGGAAAGATGAAGTTGGTGAAGTTGGAAAACAGATTA 
ATGGTATGTATGAGCACTTGTTGAAGGTTATTTATGAGTTGGAAAGTCGTAATGAGCAAATTGTAAAATTGCAAAAT 
CAAAAGGTTTCCTTTGTCCGCGGAGCATCACATGAGTTGAAAACCCCTTTAGCCAGTCTTAGAATTATCCTAGAGAA 
T ATGC AGC AT AAT ATTGGAG AT T AC AAAG AT CAT C CAAAAT AT AT TGC AAAGAGT AT AAAT AAG AT TG AC C AGATGA 
GCCACTTATTAGAAGAAGTACTGGAGTCTTCTAAATTCCAAGAGTGGACAGAGTGTCGTGAGACCTTGACTGTTAAG 
CCAGTTTTAGTAGATATTTTATCACGTTATCAAGAATTAGCTCATTCAATAGGTGTTACAATTGAAAATCAATTGAC 
AGATGCTACCAGGGTCGTCATGAGTCTTAGGGCATTGGATAAGGTTTTGACAAACCTGATTAGTAATGCAATTAAAT 
ATTCAGATAAAAATGGGCGTGTAATCATATCCGAGCAAGATGGCTATCTCTCTATCAAAAATACATGTGCGCCTCTA 
AGTGACCAAGAACTAGAACATTTATTTGATATATTCTATCATTCTCAAATCGTGACAGATAAGGATGAAAGTTCCGG 
TTTGGGT CTT T AC AT T GTG AAT AAT ATTTT AG AAAG C T AT C AAATGG AT T AT AGTT TT C T C C C T T ATG AAC ACG GT A 
TGGAATTTAAGATTAGCTTGTAG 



MIKNPKLLTKSFLRS FAILGGVGLVIHIAI YLTFPFYYIQLEGEKFNESARVFTEYLKTKTSDEI PSLLQSYSKSLT 
ISAHLKRDIVDKRLPLVHDLDIKDGKLSNYIVMLDMSVST^ 

FSFVFSYFYTKRLLNPLFYISEVTSKMQDLDDNIRFDESRKDEVGEVGKQINGMYEHLLKVIYELESRNEQIVKLQN 
QKVSFVRGASHELKTPLASLRIILENMQHNIGDYKDHPKYIAKSINKIDQMSHLLEEVLESSKFQEWTECRETLTVK 
PVLVD I LSRYQELAHS I GVT I ENQLTDATRWMS LRALDKVLTNLI SNAI KYSDKNGRVI I S EQDGYLS I KNTCAPL 
SDQELEHLFDI FYHSQIVTDKDESSGLGLYIVNNILESYQMDYSFLPYEHGMEFKISLZ 

ID202 - 4106.9 

ATGGATAAAATTATTAAAACTATATCAGAAAGCGGAGCCTTTCGTGCTTTTGTCCTTGATAGCACTGAAACCGTCCG 
CACTGCTCAAGAAAAACATCAAACCCAAGCTAGCTCAACTGTAGCGCTTGGTCGAACTCTTATCGCTAGCCAGATTC 
TCGCAGCCAATGAAAAAGGAAATACCAAACTTACAGTTAAGGTGTTGGGATCTAGCTCTCTAGGTGCTATTATCACC 
GTCGCTGATACCAAGGGGAACGTCAAAGGCTATGTTCAAAATCCTGGTGTTGACATCAAAAAGACTGCGACTGGTGA 
AGTCCTAGTCGGACCTTTTGTTGGAAATGGTCAATTCCTCGTTATCACAGACTACGGTACTGGAAATCCTTACAACT 
CTATAACTCCCCTCATCTCTGGAGAAATCGGTGAAGACCTTGCCTTTTACCTTACTGAAAGCCAACAAACGCCTTCA 
GCGGTCGGCCTCAATGTCCTTTTGGACGAGGAAGACAAGGTCAAGGTTGCAGGTGGTTTCCTAGTTCAAGTCTTGCC 
AGGAGCCAAGAAAGAAGAGATTGCTCGCTTTGAAAAACG CAT CC AAGAAATGCCAGCTATCTCTACTCTTCTCGAAA 
GCGAC G AC CAT AT CG AAG C C CT C C T C AAGG CT ATC T ACGG GG AC G AAG C CT AC AAG CG T CT TT CTG AAG AAG AAAT C 
CGTTTCCAATGTGACTGTAGCCATGAACGCTTTATGAACGCTCTTGCCAGCCTTCCAAGCTCAGACTTACAGGAAAT 
GAAAGAGGAAGACCACGGGGCAGAAATCACTTGTCAATTCTGCCAAACTACTTACAACTTTGATGAAAAGGACCTGG 
AGGAACTCATTCGTGACAAATCTTAA 



MDKI IKTISESGAFRAFVLDSTETVRTAQEKHQTQASSTVALGRTLIASQILAANEKGNTKLTVKVLGSSSLGAI IT 
VADTKGNVKGYVQNPGVDIKKTATGEVLVGPFVGNGQFLVITDYGTGNPYNSITPLISGEIGEDLAFYLTESQQTPS 
AVGLNVLLDEEDKVKVAGGFLVQVLPGAKKEEIARFEKRIQEMPAISTLLESDDHIEALLKAI YGDEAYKRLSEEEI 
RFQCDCSHERFMNALASLPSSDLQEMKEEDHGAEITCQFCQTTYNFDEKDLEELIRDKSZ 



ID203 - 4115 

ATGAAATCAATAACTAAAAAGATTAAAGCAACTCTTGCAGGAGTAGCTGCCTTGTTTGCAGTATTTGCTCCATCATT 
T GT AT C T G C T C AAG AAT CAT C AACTT AC ACTGT T AAAG AAGGTG AT AC ACT T T C AG AAATC G C TGAAAC T C AC AAC A 
C AAC AG TTG AAAAAT T GG C AG AAAAC AAC C AC AT T GAT AAC ATT CAT T TG AT TT ATGT TG AT C AAG AGT TGGTT AT C 
GATGGCCCTGTAGCGCCTGTTGCAACACCAGCGCCAGCTACTTATGCGGCACCAGCCGCTCAAGATGAAACTGTTTC 
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AGCTCCAGTAGCAGAAACTCCAGTAGTAAGTGAAACAGTTGTTTCAACTGTAAGCGGATCTGAAGCAGAAGCCAAAG 
AATGGATCGCTCAAAAAGAATCAGGTGGTAGTATACAGCTACAAATGGACGTTATATCGGACGTTACCAATTAA 



MKSITKKIKATLAGVAALFAVFAPSFVSAQESSTYTV 

DGPVAPVATPAPATYAAPAAQDEWSAPVAETPWSETWSTVSGSEAEAKEWIAQKESGGSIQLQMDVISDVTNZ 
ID204 - 4117.1 

ATGAATTTAGGAGAATTTTGGTACAATAAAATAAATAAGAACAGAGGAAGAAGGTTAATGAAGAAAGTAAGATTTAT 
TTTTTTAGCTCTGCTATTTTTCTTAGCTAGTCCAGAGGGTGCAATGGCTAGTGATGGTACTTGGCAAGGAAAACAGT 
ATCTGAAAGAAGATGGCAGTCAAGCAGCAAATGAGTGGGTTTTTGATACTCATTATCAATCTTGGTTCTATATAAAA 
GCAGATGCTAACTATGCTGAAAATGAATGGCTAAAGCAAGGTGACGACTATTTTTACCTCAAATCTGGTGGCTATAT 
GGCCAAATCAGAATGGGTAGAAGACAAGGGAGCCTTTTATTATCTTGACCAAGATGGAAAGATGAAAAGAAATGCTT 
GGGTAGGAACTTCCTATGTTGGTGCAACAGGTGCCAAAGTAATAGAAGACTGGGTCTATGATTCTCAATACGATGCT 
TGGTTTTATATCAAAGCAGATGGACAGCACGCAGAGAAAGAATGGCTCCAAATTAAAGGGAAGGACTATTATTTCAA 
ATCCGGTGGTTATCTACTGACAAGTCAGTGGATTAATCAAGCTTATGTGAATGCTAGTGGTGCCAAAGTACAGCAAG 
GTTGGCTTTTTGACAAACAATACCAATCTTGGTTTTACATCAAAGAAAATGGAAACTATGCTGATAAAGAATGGATT 
TTCGAGAATGGTCACTATTATTATCTAAAATCCGGTGGCTACATGGCAGCCAATGAATGGATTTGGGATAAGGAATC 
TTGGTTTTATCTCAAATTTGATGGGAAAATGGCTGAAAAAGAATGGGTCTACGATTCTCATAGTCAAGCTTGGTACT 
ACTTCAAATCCGGTGGTTACATGACAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTTTTATCTCAAATCTGAT 
GGGAAAATAGCTGAAAAAGAATGGGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCCGGTGGTTACAT 
GACAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTTTTACCTCAAATCTGATGGGAAAATAGCTGAAAAAGAAT 
GGGTCTACGATTCTCATAGTCAAGCTTGGTACTACTTCAAATCTGGTGGCTACATGGCGAAAAATGAGACAGTAGAT 
GGTTATCAGCTTGGAAGCGATGGTAAATGGCTTGGAGGAAAAACTACAAATGAAAATGCTGCTTACTATCAAGTAGT 
GCCTGTTACAGCCAATGTTTATGATTCAGATGGTGAAAAGCTTTCCTATATATCGCAAGGTAGTGTCGTATGGCTAG 
ATAAGGATAGAAAAAGTGATGACAAGCGCTTGGCTATTACTATTTCTGGTTTGTCAGGCTATATGAAAACAGAAGAT 
TTACAAGCGCTAGATGCTAGTAAGGACTTTATCCCTTATTATGAGAGTGATGGCCACCGTTTTTATCACTATGTGGC 
TCAGAATGCTAGTATCCCAGTAGCTTCTCATCTTTCTGATATGGAAGTAGGCAAGAAATATTATTCGGCAGATGGCC 
TGCATTTTGATGGTTTTAAGCTTGAGAATCCCTTCCTTTTCAAAGATTTAACAGAGGCTACAAACTACAGTGCTGAA 
GAATTGGATAAGGTATTTAGTTTGCTAAACATTAACAATAGCCTTTTGGAGAACAAGGGCGCTACTTTTAAGGAAGC 
CG AAG AAC ATT AC CAT AT C AATG C TC TTT AT CTC C T TG C C C AT AGTG C C C TAG AAAGT AACTGGGGAAGAAG T AAAA 
TTGCCAAAGATAAGAATAATTTCTTTGGCATTACAGCCTATGATACGACCCCTTACCTTTCTGCTAAGACATTTGAT 
GATGTGGATAAGGGAATTTTAGGTGCAACCAAGTGGATTAAGGAAAATTATATCGATAGGGGAAGAACTTTCCTTGG 
AAACAAGGCTTCTGGTATGAATGTGGAATATGCTTCAGACCCTTATTGGGGCGAAAAAATTGCTAGTGTGATGATGA 
AAATCAATGAGAAGCTAGGTGGCAAAGATTAG 



I^LGEFWYNKINKNRGRRLMKKVRFIFLALLFFLASPEGAMASDGTWQGKQYLKEDGSQAANEWVFDTHYQSWFYIK 
ADANYAENEWLKQGDDYFYLKSGGYMAKSEOTEDKGAFYYLDQDGK^ 

WFYIKADGQHAEKSWLQIKGKDYYFKSGGYLLTSQWINQAYWASGAKVQQGWLFDKQYQSWFYIKENGNYADKEWI 
FENGHYYYLKSGGYIW^EWIWDKESWFYLKFDGKI^EKEWVYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKS^ 
GKIAEKEOTYDSHSQAWYYFKSGGYMTANEWIWDK^SWFYLKSDGKIAEKEWYDSHSQAWYYFKSGGYMAKNETVD 
GYQLGSDGKWLGGKTTNENAAYYQWPVTAOTYDSDGEKXj^^ 

LQALDASKDFIPYYESDGHRFYHYVAQNASI PVASHLSDMEVGKKYYSADGLHFDGFKLENPFLFKDLTEATNYSAE 
ELDKVFSLLNINNSLLENKGATFKEAEEHYHINALYLLAHSALESNWGRSKIAKDKNNFFGITAYDTTPYLSAKTFD 
DVDKGILGATKWIKENYIDRGRTFLGNKASGMNVEYASDPYWGEKIASVMMKINEKLGGKDZ 

ID205 - 4118.1 

ATGAAAAAATTAGGTACATTACTCGTTCTCTTTCTTTCTGCAATCATTCTTGTAGCATGTGCTAGCGGAAAAAAAGA 
TACAACTTCTGGTCAAAAACTAAAAGTTGTTGCTACAAACTCAATCATCGCTGATATTACTAAAAATATTGCTGGTG 
ACAAAATTGACCTTCATAGTATCGTTCCGATTGGGCAAGACCCACACGAATACGAACCACTTCCTGAAGACGTTAAG 
AAAACTTCTGAGGCTAATTTGATTTTCTATAACGGTATCAACCTTGAAACAGGTGGCAATGCTTGGTTTACAAAATT 
GGTAGAAAATGCCAAGAAAACTGAAAACAAAGACTACTTCGCAGTCAGCGACGGCGTTGATGTTATCTACCTTGAAG 
GT C AAAATGAAAAAGGAAAAG AAG AC CC AC AC GCTTGGCTT AAC CTTGAAAACGGT ATT AT TTTTGCTAAAAAT AT C 
GCCAAACAATTGAGCGCCAAAGACCCTAACAATAAAGAATTCTATGAAAAAAATCTCAAAGAATATACTGATAAGTT 
AGACAAACTTGATAAAGAAAGTAAGGATAAATTTAATAAGATCCCTGCTGAAAAGAAACTCATTGTAACCAGCGAAG 
GAGCATTCAAATACTTCTCTAAAGCCTATGGTGTCCCAAGTGCTTACATCTGGGAAATCAATACTGAAGAAGAAGGA 
ACTCCTGAACAAATCAAGACCTTGGTTGAAAAACTTCGCCAAACAAAAGTTCCATCACTCTTTGTAGAATCAAGTGT 
GGATGACCGTCCAATGAAAACTGTTTCTCAAGACACAAACATCCCAATCTACGCTCAAATCTTTACTGACTCTATCG 
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CAGAACAAGGTAAAGAAGGCGACAGCTACTACAGCATGATGAAATACAACCTTGACAAGATTGCTGAAGGATTGGCA 
AAATAA 



MKKLGTLLVLFLSAIILVACASGKKDTTSGQKLKVVATNSIIADITKNIAGDKIDLHSIVPIGQDPHEYEPLPEDVK 
KTS E AN L I F YNG I N L E TGGNAW FT KL VEN A KKT EN KD Y F AV S D G VD V IYLEGQNE KG KE D P HAW LN LE NG 1 1 F AKN I 
AKQLSAKDPNNKEFYEKNLKEYTDKLDKLDKESKDKFNKIPAEKKLIVTSEGAFKYFSKAYGVPSAYIWEINTEEEG 
TPEQIKTLVEKLRQTKVPSLFVESSVDDRPMKTVSQDTNIPIYAQIFTDSIAEQGKEGDSYYSMMKYNLDKIAEGLA 
KZ 

ID206 - 4119.1 

ATGGAATGGTATAAAAAAATCGGACTTCTTGCAACTACAGGTTTAGCTTTGTTTGGGCTCGGCGCTTGCTCCAACTA 
TGGTAAATCTGCGGATGGCACAGTGACCATCGAGTATTTCAACCAGAAAAAAGAAATGACCAAAACCTTGGAAGAAA 
TCACTCGTGATTTTGAGAAGGAAAACCCTAAGATCAAGGTCAAAGTCGTCAATGTACCAAATGCTGGTGAAGTATTG 
AAGACACGCGTTCTCGCAGGAGATGTGCCTGATGTGGTCAATATTTACCCACAGTCCATCGAACTGCAAGAATGGGC 
AAAAGCAGGTGTTTTTGAAGATTTGAGCAACAAAGACTACCTGAAACGCGTGAAAAATGGCTACGCTGAAAAATATG 
CTGTAAACGAAAAAGTTTACAACGTTCCTTTTACAGCTAATGCTTATGGAATTTACTACAACAAAGATAAATTCGAA 
GAACTGGGCTTGAAGGTTCCTGAAACCTGGGATGAATTTGAACAGTTAGTCAAAGATATCGTTGCTAAAGGACAAAC 
ACCATTTGGAATTGCAGGTGCAGATGCTTGGACACTCAATGGTTACAATCAATTAGCCTTTGCGACAGCAACAGGTG 
GAGGAAAAGAAGCAAATCAATACCTTCGTTATTCTCAACCAAATGCCATTAAATTGTCGGATCCGATTATGAAAGAT 
GATATCAAGGTCATGGACATCCTTCGCATCAATGGATCTAAGCAAAAGAACTGGGAAGGTGCTGGCTATACCGATGT 
TATCGGAGCCTTCGCACGTGGGGATGTCCTCATGACACCAAATGGGTCTTGGGCGATCACAGCGATTAATGAACAAA 
AACCGAACTTTAAGATTGGGACCTTCATGATTCCAGGAAAAGAAAAAGGACAAAGCTTAACCGTTGGTGCGGGAGAC 
TTGGCATGGTCTATCTCAGCCACCACCAAACATCCAAAAGAAGCCAATGCCTTTGTGGAATATATGACCCGTCCAGA 
AGT C ATG C AAAAAT ACT ACG ATGTGG ACGG AT C T C C AAC AG CG AT CG AAGGGGTC AAAC AAG C AGG AG AAG AT T C AC 
CGCTTGCTGGTATGACCGAATATGCCTTTACGGATCGTCACTTGGTCTGGTTGCAACAATACTGGACCAGTGAAGCA 
GACTTCCATACCTTGACCATGAACTATGTCTTGACCGGTGATAAACAAGGCATGGTCAATGATTTGAATGCCTTCTT 
T AAC C CG ATG AAAG CGG AT GTGGAT TAG 



MEWYKKIGLLATTGIALFGLGACSNYGKSADGTVTIEYFNQKKEMTKTLEEITRDFEKJSNPKIKVKOTNVPNAGEVL 
KTRVLAGDVPDVVNI YPQSIELQEWAKAGVFEDLSNKDYLKRVKNGYAEKYAVNEKVYNV 

ELGLKVPETWDEFEQLVKDIVAKGQTPFGIAGADAWTLNGYNQLAFATATGGGKEANQYLRYSQPNAIKLSDPIMKD 
DIKVMDILRINGSKQKNWEGAGYTDVIGAFARGDVLMTPNGSWAITAINEQKPNFKIGTFMIPGK^KGQSLTVGAGD 
LAWS I S ATTKHPKEANAFVE YMTRPEVMQKY YDVDGS PTAI EGVKQAGEDS PLAGMTE YAFTDRHLVWLQQ YWTS EA 
DFHTLTMNYVLTGDKQGMVNDLNAFFNPMKADVDZ 

ID207 - 4123.1 

ATG AAG AAAAT C AAAC C G C ATGGAC CGTT AC C AAGT C AG ACT C AG CT AG CTT AT C T GGGAG AT G AACT AG C AG C T T T 
TATCCACTTCGGTCCTAATACCTTTTATGACCAAGAATGGGGGACTGGACAGGAGGATCCTGAGCGCTTTAACCCGA 
GTCAGTTGGATGCGCGTGAGTGGGTTCGTGTGCTCAAGGAAACGGGCTTCAAAAAGTTGATTTTGGTGGTCAAGCAC 
CACGATGGCTTTGTCCTTTATCCGACAGCTCACACAGATTATTCGGTTAAGGTCAGTCCTTGGAGGAGAGGAAAGGG 
CGACTTGCTCCTTGAAGTATCCCAAGCTGCCACAGAGTTTGATATGGATATGGGGGTCTACCTGTCACCGTGGGATG 
CCCATAGTCCCCTCTATCATGTGGACCGAGAAGCGGACTACAATGCCTATTATCTGGCTCAGTTGAAGGAAATCTTA 
TCAAATCCTAACTATGGGAATGCTGGTAAGTTCGCTGAGGTTTGGATGGATGGTGCCAGAGGAGAGGGCGCGCAAAA 
GGTTAATTATGAATTTGAAAAATGGTTTGAAACCATTCGTGACCTGCAGGGCGATTGCTTGATTTTTTCAACAGAAG 
GCACCAGTATCCGCTGGATTGGCAATGAACGAGGGTATGCAGGTGATCCACTGTGGCAAAAGGTGAATCCTGATAAA 
CTAGGAACAGAAGCAGAGCTGAACTATCTTCAGCACGGGGATCCCTCGGGCACGATTTTTTCAATCGGAGAGGCAGA 
TGTTTCCATCCGTCCAGGCTGGTTCTACCATGAGGATCAGGATCCTAAGTCTCTCGAGGAGTTGGTCGAAATCTACT 
TTCACTCAGTAGGGCGAGGAACTCCACTCTTGCTTAATATTCCGCCGAATCAAGCTGGGCTCTTTGATGCAAAGGAT 
ATTGAACGACTTTATGAATTTGCGACCTATCGCAATGAGCTCTATAAAGAAGATTTGGCTCTGGGAGCTGAGGTATC 
TGGTCCAGCTCTTTCCGCAGACTTTGCTTGTCGCCATTTGACAGACGGCCTTGAGACCAGCTCTTGGGCAAGCGATG 
CAGACTTGCCCATCCAGTTAGAACTCGACTTAGGTTCTCCTAAAACTTTTGATGTAATTGAGTTAAGAGAAGATTTG 
AAGCTAGGGCAACGAATCGCTGCTTTTCATGTGCAAGTAGAGGTGGATGGTGTCTGGCAGGAGTTTGGTTCGGGTCA 
TACTGTTGGTTACAAACGTCTCTTACGAGGAGCAGTTGTTGAGGCACAGAAGATACGTGTAGTCATTACAGAATCAC 
AGGCTTTGCCTTTGTTGACCAAGATTTCCCTTTATAAAACTCCTGGATTATCAAAAAAAGAAGTTGTTCAGGAACTA 
GCATTTGCAGAAAAAAGCCTAGCTGTGGCAAAGGGAGAAAATGCCTATTTTACAGTTAAGCGCAGAGAATGTAGTGG 
T C CTTT AG AAG CT AAG ATTT CG AT T C AAC C GGGG AC AGGTG T C C ATGGTG T CG C C TAT C AGG ATG AG ATT C AAGT C C 
TTGCGTTTCAAACTGGTGAGACTGAAAAAAGTCTGACGCTACCAACCTTGTATTTCGCAGGAGATAAAACCTTGGAT 
TTCTATCTGAACCTAACGGTGGATGGTCAGCTTGTGGATCAACTTCAAGTCCAAGTTTCATAA 
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MKKIKPHGPLPSQTQLAYLGDELAAFIHFGPNTFYDQEWGTGQEDPERFNPSQLDAREWVRVLKETGFKKLILVVKH 

HDGFVL YPTAHTD YSVKVS PWRRGKGDLLLE VSQAATE FDMDMGVYLS PWDAHS PL YHVDREAD YNAYYLAQLKE I L 
SNPNYGNAGKFAEVWMDGARGEGAQKVNYEFEKWFETIRDLQGDCLIFSTEGTSIRWIGNERGYAGDPLWQKVNPDK 
5 LGTEAELNYLQHGDPSGTI FS IGEADVS IRPGWFYHEDQDPKSLEELVEI YFHSVGRGTPLLLNI PPNQAGLFDAKD 

IERLYEFATYRNELYKEDLALGAEVSGPALSADFACRHLTDGLETSSWASDADLPIQLELDLGSPKTFDVIELREDL 
KLGQRIAAFHVQVEVDGVWQEFGSGHTVGYKRLLRGAVVEAQKIRWITESQALPLLTKISLYKTPGLSKKEVVQEL 
AFAEKSLAVAKGENAYFTVKRRECSGPLEAKIS IQPGTGVHGVAYQDEIQVLAFQTGETEKSLTLPTLYFAGDKTLD 

FYLNLTVDGQLVDQLQVQVS Z 

10 

ID208 - 4125.12 

ATGCTTGAAAGACTGAAAAGAATACATTATATGTTTTGGATCAGTTTAATTTTTATGATTTTCCCCATCCTGTCTGT 
AGTGACTGGGTGGCTTTCTGCCTGGCATTTATTGATTGATATTCTATTTGTAGTGGCATATTTGGGTGTTTTAACAA 
CTAAGAGCCAGCGCCTATCTTGGCTATATTGGGGCCTCATGCTGACTTATGTAGTTGGGAATACTGCCTTTGTTGCT 

15 GTTAATTATATCTGGTTTTTCTTTTTCCTATCCAATCTCTTAAGTTATCATTTCAGCGTACGTAGTTTAAAGTCTTT 
ACATGTCTGGACTTTTCTTCTTGCTCAAGTCCTTGTTGTGGGGCAACTGTTGATTTTTCAGAGAATCGAAGTTGAGT 
TTCTATTCTATCTACTTGTAATTCTTACTTTTGTCGATTTAATGACTTTTGGATTGGTTCGGATTCGTATTGTCGAG 
GATTTGAAAGAAGCTCAGGTCAAGCAAAATGCTCAGATAAATCTATTGCTTGCTGAAAATGAACGTAGTCGTATCGG 
TCAGGATTTGCATGATAGTCTGGGACATACCTTTGCTATGCTGAGTGTCAAGACAGATTTAGCCTTGCAGTTATTTC 

20 AGATGGAGGCTT ATC CACAGGTGGAAAAGGAATT AAAAGAAATTCAC CAGAT CAGCAAGGATCCATGA 



MLERLKRIHYMFWISLIFMIFPILSWTGWLSAWHLLIDILFVVAYLGVLTTKSQRLSWLYWGLMLTYWGNTAFVA 
VNYIWFFFFLSNLLSYHFSVRSLKSLHVWTFLLAQVLWGQLLIFQRIEVEFLFYLLVILTFVDLMTFGLVRIRIVE 
25 DLKEAQVKQNAQINLLLAENERSRIGQDLHDSLGHTFAMLSVKTDIALQLFQMEAYPQVEKELKEIHQISKDPZ 

ID209 - 4126.3 

ATGAATGATAAGTTAAAAATCTTCTTGTTGCTAGGAGTATTTTTTCTAGCCATAACCGGTTTCTATGTTCTATTGAT 
ACGAAAT G C AGGG CAG AC AGATG C CT CG C AAAT TG AAAAGG CGG C AGT TAG C C AAGGAGG AAAAG C AGTG AAAAAAA 

30 CAGAAATTAGTAAAGACGCAGACTTGCACGAAATTTATCTAGCTGGAGGTTGTTTCTGGGGAGTGGAGGAATATTTC 
TCACGTGTTCCCGGGGTGACGGATGCCGTTTCAGGCTATGCAAATGGTAGAGGAGAAACAACCAAGTACGAATTGAT 
TAACCAAACAGGTCATGCAGAAACCGTCCATGTCACCTATGATGCCAAGCAAATTTCTCTCAAGGAAATCCTGCTTC 
ACTATTTCCGCATTATCAATCCAACCAGCAAAAATAAACAAGGAAATGATGTGGGGACCCAGTACCGTACTGGTGTT 
TATTACACAGATGACAAGGATTTGGAAGTGATTAACCAAGTCTTTGATGAGGTGGCTAAGAAATACGATCAACCTCT 

35 AGCAGTTGAAAAGGAAAACTTGAAGAATTTTGTGGTGGCTGAGGATTACCATCAAGACTATCTCAAGAAAAATCCAA 
ATGGCTACTGCCATATCAATGTTAATCAGGCGGCCTATCCTGTCATTGATGCCAGCAAATATCCAAAACCAAGTGAT 
GAGGAATTGAAAAAGACCCTGTCACCTGAGGAGTATGCAGTTACCCAGGAAAATCAAACAGAACGAGCTTTCTCAAA 
CCGTTACTGGGATAAATTTGAATCCGGTATCTATGTGGATATAGCAACTGGGGAACCTCTCTTTTCATCAAAAGACA 
AATTTGAGTCTGGTTGTGGCTGGCCTAGTTTTACCCAACCCATCAGTCCAGATGTTGTCACCTACAAGGAAGATAAG 

40 TCCTACAATATGACGCGTATGGAAGTGCGGAGCCGAGTAGGAGATTCTCACCTTGGGCATGTCTTTACGGATGGTCC 
AC AGG AC AAGGG CGG CTT ACGT T AC TGT AT C AAT AG C C T CT CT AT CCG CT TT AT T C C C AAAG AC C AAATGG AAG AAA 
AAGgcTACGCTTATTTACTAGATTATGTTGATTAA 

45 MNDKLKI FLLLGVFFLAI TGFYVLL I RNAGQTDASQ I E KAAVSQGGKAVKKTE I S KDADLHE I YLAGGCFWGVEE YF 

SRVPGVTDAVSGYANGRGETTKYELINQTGHAETVHVTYDAKQISLKEILLHYFRI INPTSKNKQGNDVGTQYRTGV 
YYTDDIODLEVINQVFDEVAKKYDQPIAVEKENLKNFVVAEDYHQDYLKKNPNGYCHINVNQAAYPVIDASKYPKPSD 
EELKKTLS PEE YAVTQENQTERAFSNRYWDKFESGI YVDI ATGE PLFS S KDKFESGCGWPS FTQPI S PDWTYKEDK 
SYNMTRMEVRSRVGDSHLGHVFTDGPQDKGGLRYCINSLS IRFI PKDQMEEKGYAYLLDYVDZ 

50 

ID210 - 4127.1 

ATGAAAAAGAAATGGATGTATTATGCTGCTTGTTCTTCTAATGAATCTGCCGATGACAGTTCATCTGATAAAGGAGA 

CGGCGG T T CG C T AGT CGTT T ATT C AC C AAAC T CAG AGGG C T T AATTGG AG C AAC T ATT C C TGC C TTTG AAG AAAAAT 
ATGGTATCAAAGTAGAACTGATTCAAGCTGGTACTGGAGAACTTTTCAAAAAACTAGAGTCAGAAAAAGAAGTTCCT 

55 GTAGCTGATGTTATCTTTGGTGGTTCTTATACACAATATACTACCCACGGAGAACTCTTTGAAAACTATACTTCAAA 
AG AAAAT GAT AATGT T AT C AAAG AAT AT C AAAAC AC AAC TGG C T ACT CT ACT C CTT AT AC AC T AGAT GGT AGTGT T T 
TAATCGTCAACCCTGATTTAACTAAAGGCATGAACATCGAAGGATATAACGATCTTTTCAAACCTGAACTAAAAGGA 
AAAATCGCAACTGCTGACCCAGCAAACTCTTCTAGCGCCTTTGCTCAATTAACAAATATGCTACAAGCTCAAGGTGG 
TT AC AAAG ATGAT AAGG CTTGGTC TT AT GT AAAAG AT C T TTT C AC ACT TATTG AT GGT AAAAT C GG T T C AAGTT C AT 

60 CTAGTGTCTATAAAGTAGTCGCTGATGGAGAAATGGCTGTTGGTCTCTCTTATGAAGATCCAGCAGTTAAACTCTTA 
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AATGACGGAGCTAACATTAAGGTAGTCTATCCAAAAGAAGGAACCGTCTTCCTACCTGCTAGTGCTGCTATCGTTAA 
AAAATCTAAAAATATGGAAAATGCCAAGAAATTTATCGATTTTATTATCTCTCAAGAAGTACAAGATACACTTGGTA 
CAACCACTACTAACCGTCCTGTTCGTAAAAATGCTAAAACAAGCGAAAACATGAAACCAATTGACAAAATCAAAACA 
CTCACTGAAGATTATGATTATGTCATCAAGAATAAATCAGATATCGTTAAGAAATACAACGAAGTCTTTACAGATAT 
CCAATCTAAACAGTAA 



MKKKWMYYAACSSNESADDSSSDKGDGGSLWYSPNSEGLIGATI PAFEEKYGIKVELIQAGTGELFKKXjESEKEVP 

VADVI FGGS YTQYTTHGELFENYTS KENDNVI KE YQNTTGYSTP YTLDGS VL I VNPDLTKGMNI EGYNDLFKPELKG 

KIATADPANSSSAFAQLTNMLQAQGGYKDDKAWSWKIDLFTLI^ 

NDGANIKWYPKEGWFLPASAAIVKKSKNMENAKK^ 

LTEDYDYVIKNKSDIVKKYNEVFTDIQSKQZ 

ID211 - 4127.2 

ATGAGTGAGATCAAAATTATTAACGCCAAAAAAATCTACCACGATGTCCCTGTTATTGAGAATTTGAACATTACAAT 
TCCAAAAGGAAGTCTCTTTACCCTTCTTGGAGCTTCAGGATGTGGGAAAACGACCCTTCTTCGTATGATTGCAGGTT 
TCAACAGTATCGAAGGTGGAGAATTTTACTTCGATGATACAAAAATCAATAATATGGAACCCAGCAAACGCAATATC 
GGGATGGTTTTCCAAAACTACGCTATTTTCCCACATTTGACTGTCCGAGACAACGTTGCTTTTGGTCTTATGCAAAA 
GAAGGTTCCAAAAGAAGAATTGATTCAACAGACCAACAAGTATCTTGAACTCATGCAAATTGCTCAATATGCGGATC 
GAAAGCCCGATAAACTCAGTGGTGGACAACAACAACGTGTCACCTTGGCATGCGCCTTAGCGGTTAATCCAAGTGTT 
CTCCTCATGGACGAGCCACTTAGTAATCTGGAGGCCAAACTTCGCTTGGATATGCGTCAAGCCATCCGAGAAATCCA 
ACACGAAGTGGGAATTACAACTGTTTATGTAACCCACGACCAAGAAGAAGCCATGGCTATTTCAGACCAAATTGCTG 
TT AT G AAAG AT GGGGTG AT C C AAC AAAT CGG C CG AC C AAAAG AACT C TAT C AT AAAC C AG CT AATG AGTTTGTGG C A 
ACCTTTATCGGACGCACAAATATTATCCCTGCCAATCTTGAAAAACGGAGCGACGGCGCTTATATCGTCTTTTCAGA 
TGGCTATGCCCTTCGAATGCCAGCTCTTGATCAGGTTGAGGAGCAAGCTATTCATGTAAGCATTCGTCCCGAAGAGT 
TTATCAAAGATGAATCTGGAGATATTGAAGGAACTATTAGAGATAGCGTCTATCTTGGACTAAATACGGATTATTTC 
ATTGAGACAGGTTTTGCCTCAAAAATTCAAGTTAGTGAAGAATCAACTTTTGAAGAAGATCTACAAAAAGGCAATCG 
TATTCGTCTACGAATCAATACGCAAAAATTAAACATCTTTTCTGCAGATGGTTCCCAAAACCTGATAAAAGGAGTCA 
ACCATGGAACGTAA 

MSEIKI INAKKI YHDVPVIENLNITIPKGSLFTLLGASGCGKTTLLRMIAGFNSIEGGEFYFDDTKINNMEPSKRNI 
GIWFQNYAIFPHLTVRDOTAFGLMQKKVPKEELIQQ 

LLMDEPLSNLEAKLRLDMRQAIREIQHEVGITTVWTHDQEEAI^ISDQIAVMKDGVIQQIGRPKELYHKPANEFVA 
TFIGRTNI IPANLEKRSDGAYIVFSDGYALRMPALDQVEEQAIHVS IRPEEFIKDESGDIEGTIRDSVYLGLNTDYF 
IETGFASKIQVSEESTFEEDLQKGNRIRLRINTQKLNIFSADGSQNLIKGVNHGTZ 

ID212 - 4136.1 

ATGAAGAAAAAATTATTGGCAGGTGCCATCACACTATTATCAGTAGCAACTTTAGCAGCTTGTTCGAAAGGGTCAGA 
AGGTGCAGACCTTATCAGCATGAAAGGGGATGTCATTACAGAACATCAATTTTATGAGCAAGTGAAAAGCAACCCTT 
C AG C C C AAC AAGT C T TGTT AAAT ATG AC CAT C CAAAAAGTTTT T G AAAAAC AAT AT GG C T C AGAG C TTG AT GAT AAA 
GAGGTTGATGATACTATTGCCGAAGAAAAAAAACAATATGGCGAAAACTACCAACGTGTCTTGTCACAAGCAGGTAT 
GACTCTTGAAACACGTAAAGCTCAAATTCGTACAAGTAAATTAGTTGAGTTGGCAGTTAAGAAGGTAGCAGAAGCTG 
AATTGACAGATGAAGCCTATAAGAAAGCCTTTGATGAGTACACTCCAGATGTAACGGCTCAAATCATCCGTCTTAAT 
AATGAAGATAAGGCCAAAGAAGTTCTCGAAAAAGCCAAGGCAGAAGGTGCTGATTTTGCTCAATTAGCCAAAGATAA 
TTCAACTGATGAAAAAACAAAAGAAAATGGTGGAGAAATTACCTTTGATTCTGCTTCAACAGAAGTACCTGAGCAAG 
TCAAAAAAGCCGCTTTCGCTTTAGATGTGGATGGTGTTTCTGATGTGATTACAGCAACTGGCACACAAGCCTACAGT 
AGC C AAT ATT AC AT TGT AAAACT C ACT AAG AAAACAG AAAAAT CAT C T AAT ATTG ATG ACT AC AAAG AAAAAT T AAA 
AACTGTTATCTTGACTCAAAAACAAAATGATTCAACATTTGTTCAAAGCATTATCGGAAAAGAATTGCAAGCAGCCA 
ATATCAAGGTTAAGGACCAAGCCTTCCAAAATATCTTTACCCAATATATCGGTGGTGGAGATTCAAGCTCAAGCAGT 
AGT AC AT C AAACG AAT AG 

MKKKLLAGAITLLSVATLAACSKGSEGADLISMKGDVITEHQFYEQVKSNPSAQQVLLNMTIQKVFEKQYGSELDDK 
EVDDTIAEEKKQYGENYQRVLSQAGMTLETRKAQIRTSKLVELAVKKVAEAELTDEAYKKAFDEYTPDVTAQIIRLN 
NEDKAKEVLEKAKAEGADFAQLAKDNSTDEKTKENGGEITFDSASTEVPEQVKKAAFALDVDGVSDVITATGTQAYS 
SQYYIVKLTKKTEKSSNIDDYKEKLKTVILTQKQNDSTFVQSIIGKELQAANIKA/KDQAFQNIFTQYIGGGDSSSSS 
STSNEZ 
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ID213 - 4137.3 

ATGAAAAAAAATATTAAACAATATGTAACCTTAGGTACTGTAGTGGTATTATCAGCATTTGTTGCTAACTCAGTTGC 
AGCTCAGGAGACTGAAACTTCTGAAGTATCAACACCAAAGTTGGTGCAACCTGTTGCACCAACGACTCCGATTTCGG 
AAGTACAACCTACATCGGATAACTCTTCGGAAGTTACTGTACAACCTCGAACAGTTGAAACTACTGTTAAGGATCCA 
TCTTCTACAGCGGAAGAAACTCCTGTCTTAGAAAAAAATAATGTTACTTTAACAGGGGGCGGAGAAAATGTTACTAA 
AGAGTTAAAGGATAAATTTACTAGCGGTGACTTTACTGTAGTGATTAAGTACAATCAGTCAAGTGAGAAAGGCTTAC 
AAGCTCTGTTTGGAATATCTAATTCCAAACCCGGTCAACAAAATAGTTATGTAGATGTGTTCCTTAGAGACAATGGT 
GAGTTGGGGATGGAAGCGCGTGATACTTCTTCCAATAAAAATAACCTAGTATCCAGACCTGCTTCAGTTTGGGGTAA 
GTACAAACAAGAGGCTGTGACTAACACTGTTGCAGTAGTAGCAGATTCAGTCAAAAAAACATATTCTTTATACGCAA 
ATGGTACAAAAGTAGTAGAAAAGAAAGTGGATAATTTCCTAAACATCAAGGATATTAAAGGTATTGATTACTATATG 
CTTGGGGGAGTGAAACGTGCAGGAAAAACGGCGTTTGGTTTTAACGGAACACTAGAAAATATCAAATTCTTTAATAG 
TGCATTGGATGAAGAAACTGTTAAAAAGATGACAACAAACGCTGTTACTGGACATTTAATTTATACGGCTAATGATA 
CAACAGGTTCTAACTATTTCCGTATTCCAGTTCTGTATACTTTTAGCAATGGTCGGGTATTTTCAAGCATTGACGCT 
CGTTACGGTGGAACTCATGATTTCTTGAATAAAATTAATATTGCTACAAGTTATAGTGATGATAATGGTAAGACATG 
GACTAAACCAAAATTAACATTGGCATTCGATGATTTTGCGCCAGTACCATTAGAATGGCCTCGTGAAGTTGGTGGAC 
GTGACTTACAAATCAGCGGTGGTGCAACCTATATTGACTCTGTTATTGTTGAAAAAAAGAACAAACAAGTACTCATG 
TTTGCTGATGTGATGCCTGCTGGAGTAAGTTTTAGAGAAGCAACTAGAAAAGATTCAGGTTATAAACAAATTGATGG 
TAATTATTACCTTAAATTAAGGAAACAAGGTGATACTGATTACAATTATACTATTCGTGAGAATGGTACTGTATACG 
ACGATCGTACCAACAGACCAACTGAATTTTCAGTAGATAAAAATTTCGGTATTAAACAAAATGGTAATTATTTGACG 
GTAGAGCGG 

MKKNIKQYVTLGTVVVLSAFVANSVAAQ 

S STAEETPVLEKNNVTLTGGGENVTKELKDKFTSGD FTW I KYNQS S EKGLQALFGI SNS KPGQQNS YVDVFLRDNG 
ELGMEARDTSSNKNNLVSRPASWGKYKQEAVTNTVAWADSVK^^ 

LGGVKRAGKTAFGFNGTLENIKFFNSALDEETVKKMTTNAVTGHLIYTANDTTGSNYFRIPVLYTFSNGRVFSSIDA 
RYGGTHDFLNKINIATSYSDDNGKTWTKPKLTLAFDDFAPVPLEWPREVGGRDLQISGGATYIDSVIVEKKNKQVLM 
FADVMPAGVS FREATRKDSGYKQIDGNYYLKLRKQGDTDYNYTIRENGTVYDDRTNRPTEFSVDKNFGIKQNGNYLT 
VER 

ID214 - 4185 

ATGAAAAAATTTAGCCTATTACTAGCTATCCTACCATTTTTGGTTGCCTGTGAGAATCAAGCTACACCCAAAGAGAC 
TAGCGCTCAAAAGACAATCGTCCTTGCTACAGCTGGCGACGTGCCACCATTTGACTACGAAGACAAGGGCAATCTGA 
CAGGCTTTGATATCGAAGTTTTAAAGGCAGTAGATGAAAAACTCAGCGACTACGAGATTCAATTCCAAAGAACCGCC 
TGGGAGAGCATCTTCCCAGGACTTGATTCTGGTCACTATCAGGCTGCGGCCAATAACTTGAGTTACACAAAAGAGCG 
TGCTGAAAAATACCTTTACTCGCTTCCAATTTCCAACAATCCCCTCGTCCTTGTCAGCAACAAGAAAAATCCTTTGA 
CTTCTCTTGACCAGATCGCTGGTAAAACAACACAAGAGGATACCGGAACTTCTAACGCTCAATTCATCAATAACTGG 
AATCAGAAACACACTGATAATCCCGCTACAATTAATTTTTCTGGTGAGGATATTGGTAAACGAATCCTAGACCTTGC 
TAACGGAGAGTTTGATTTCCTAGTTTTTGACAAGGTATCCGTTCAAAAGATTATCAAGGACCGTGGTTTAGACCTCT 
CAGTCGTTGATTTACCTTCTGCAGATAGCCCCAGCAATTATATCATTTTCTCAAGCGACCAAAAAGAGTTTAAAGAG 
CAATTTGATAAAGCGCTCAAAGAACTCTATCAAGACGGAACCCTTGAAAAACTCAGCAATACCTATCTAGGTGGTTC 
T T A C C T C C C AG AT C AAT CT C AG T T AC AAT AA 



MKKFSLLLAILPFLVACENQATPKETSAQKTIVLATAGDVPPFDYEDKGNLTGFDIEVLKAVDEKLSDYEIQFQRTA 
WESIFPGLDSGHYQAAANNLSYTKERAEKYLYSLPISNNPLVLVSNKKNPLTSLDQIAGKTTQEDTGTSNAQFINNW 
NQKHTDNPATINFSGEDIGKRILDLANGEFDFLVFDKVSVQKI IKDRGLDLSVVDLPSADSPSNYI IFSSDQKEFKE 
QFDKALKELYQDGTLEKLSNTYLGGSYLPDQSQLQZ 

ID215 - 4211.1 

ATGAAAAAAAATAGTTTATATATCATATCCTCACTCTTTTTTGCTTGTGTCTTATTTGTCTATGCTACGGCGACGAA 
T T TT C AAAAC AGT AC C AGTG CT AGG C AG GT AAAAAC GG AAAC CT AT ACT AAT AC AGT AAC AAATGT C C CT AT TG AC A 
TACGCTATAATAGTGATAAGTATTTTATTAGCGGTTTTGCTTCAGAAGTATCAGTGGTCTTGACTGGTGCAAATCGC 
CTATCGCTAGCTAGTGAAATGCAAGAAAGTACACGTAAATTCAAGGTTACTGCTGACCTAACAGATGCCGGTGTTGG 
AACGATTGAAGTTCCTTTGAGCATTGAAGATTTACCCAATGGGCTGACCGCTGTGGCGACTCCGCAAAAAATTACAG 
TCAAGATTGGTAAGAAGGCTCAGAAGGATAAGGTAAAGATTGTACCAGAGATTGACCCTAGTCAAATTGATAGTCGG 
GTACAAATTGAAAATGTCATGGTGTCAGATAAAGAAGTGTCTATTACGAGTGACCAAGAGACATTGGATAGAATTGA 
TAAGATTATCGCTGTTTTGCCAACTAGCGAACGTATAACAGGTAATTACAGTGGTTCAGTACCTTTGCAGGCAATCG 
AC CG C AAT GGT GTTGT C T T AC C GG C AGT TAT C AC T C C GTTTGAT AC AAT AAT G AAGG TG AC T AC AAAAC C AGT AG C A 
C C AAGTT C AAG C AC AT C AAAT T C AAGT AC AAG C AGT T CAT CGGAG AC AT CT T CGT C AACG AAAG C AACT AGTT C AAA 
AACGAATTAA 
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MKKNSLYIISSLFFACVLFVYATATNFQNSTSARQVKTETYTNTVTNVPIDIRYNSDKYFISGFASEVSVVLTGANR 
LSIJ^EMQESTRKFKVTADLTDAGVGTIEVPLSIEDLPNGLTAVATPQKITVKIGKKAQKDBCVKIVPEIDPSQIDSR 
VQIENVMVSDKEVSITSDQETLDRIDKI IAVLPTSERITGNYSGSVPLQAIDRNGWLPAVITPFDTIMKVTTKPVA 
5 PSSSTSNSSTSSSSETSSSTKATSSKTNZ 

ID216 - 4127.3 

ATGTTGATTGGCGAAGGGTATCGGACTTTCCCTGTCCTGATTTATACCCAATTTATTAGCGAGGTTGGAGGAAATTC 
TGCTTTTGCAATTATGGCGATTATCATTGCCTTGGCAATTTTCCTTATCCAAAAACACATTGCAAACCGCTACAGTT 

10 TCAGCATGAATCTGCTCCATCCAATTGAGCCTAAAAAAACTACAAAAGGAAAAATGGCTGCCATTTATGCAACAGTC 
TACGGAATTATCTTTATCTCTGTTTTACCTCAAATCTACTTAATTTATACCTCTTTCCTAAAAACATCAGGTATGGT 
ATCTGTTAAAGGTTATTCTCCAAACAGTTACAAGGTAGCTTTCCATCGTATGGGATCTGCTATTTTCAATACCATTC 
GTATCCCTTTGATTGCCTTAGTTCTAGTTGTTCTATTTGCGACATTTATCTCCTACCTAGCCGTTAGAAAACGGAAT 
TTGTTTACAAACTTAATTGACAGCCTCAGTATGGTACCTTATATTGTACCAGGAACCGTTCTAGGGATTGCCTTCAT 

15 TTCTTCCTTCAATACTGGTCTATTTGGAAGTGGATTTCTTATGATTACAGGGACTGCTTTCATCTTGATTAT.GTCTC 
TATCTGCCAGAAGATTACCTTATACTATTCGCTCATCTGTTGCTAGCTTACAACAAATAGCACCAAGTATTGAAGAA 
GCTGCTGAAAGCTTAGGAAGTAGTCGTCTCAATACCTTTGCTAAGATTACAACTCCAATGATGCTATCTGGTATCAT 
TTCTGGAGCCATCTTATCTTGA 

20 

MLIGEGYRTFPVLIYTQFISEVGGNSAFAIMAI I IALAIFLIQKHIANRYSFSI^LLHPIEPKKTTKGKMAAIYATV 
YGIIFISVLPQIYLIYTSFLKTSGWSVKGYSPNSYKVAFHRMGSAIFNTIRIPLIALVLVVLFATFISYLAVRKRN 
LFTNLIDSLSMVPYIVPGTVLGIAFISSFNTGLFGSGFLMITGTAFILIMSLSARRLPYTIRSSVASLQQIAPSIEE 
AAESLGSSRLNTFAKITTPMMLSGIISGAILSZ 

25 
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Table 4 
ID301 

ATGAATAAGAAAAAAATGATTTTAACAAGTCTAGCCAGCGTCGCTATCTTAGGGGCTGGTTTTGTTACGTCTCAGCC 
TACTTTTGT AAGAG C AG AAG AATCT C C AC AAGT TGT CGAAAAAT C TT CATT AGAG AAG AAAT ATGAGGAAGC AAAAG 
CAAAAG CTGATACTGC CAAGAAAG ATT ACGAAACGGC T AAAAAGAAAG C AGAAG ACG C T C AGAAAAAGT AT GAAG AT 
GATCAGAAGAGAACTGAGGAGAAAGCTCGAAAAGAAGCAGAAGCATCTCAAAAATTGAATGATGTGGCGCTTGTTGT 
TC AAAATG CAT ATAAAGAGT AC CG AGAAGTT C AAAAT C AACGT AG T AAAT AT AAATCTGACGCTGAAT AT C AG AAAA 
AATTAACAGAGGTCGACTCTAAAATAGAGAAGGCTAGGAAAGAGCAACAGGACTTGCAAAATAAATTTAATGAAGTA 
AGAGCAGTTGTAGTTCCTGAACCAAATGCGTTGGCTGAGACT AAG AAAAAAGCAG AAG AAG CTAAAGCAGAAG AAAA 
AGTAG CT AAGAGAAAAT ATGATT ATGCAACT CT AAAGGT AG C ACT AG CGAAG AAAGAAGT AGAGGCT AAGGAACTTG 
AAATTGAAAAACTTCAATATGAAATTTCTACTTTGGAACAAGAAGTTGCTACTGCTCAACATCAAGTAGATAATTTG 
AAAAAACTTCTTGCTGGTGCGGATCCTGATGATGGCACAGAAGTTATAGAAGCTAAATTAAAAAAAGGAGAAGCTGA 
GCTAAACGCTAAACAAGCTGAGTTAGCAAAAAAACAAACAGAACTTGAAAAACTTCTTGACAGCCTTGATCCTGAAG 
GTAAG AC T CAGGATGAAT T AGAT AAAGAAGCAG AAGAAG CTG AGTTGGAT AAAAAAG CTG ATGAACTT CAAAATAAA 
GTTGCTGATTTAGAAAAAGAAATTAGTAACCTTGAAATATTACTTGGAGGGGCTGATCCTGAAGATGATACTGCTGC 
TCTTCAAAATAAATTAGCTGCTAAAAAAGCTGAGTTAGCAAAAAAACAAACAGAACTTGAAAAACTTCTTGACAGCC 
TTGATCCTGAAGGTAAGACTCAGGATGAATTAGATAAAGAAGCAGAAGAAGCTGAGTTGGATAAAAAAGCTGATGAA 
CTTCAAAATAAAGTTGCTGATTTAGAAAAAGAAATTAGTAACCTTGAAATATTACTTGGAGGGGCTGATTCTGAAGA 
TGATACTGCTGCTCTTCAAAATAAATTAGCTACTAAAAAAGCTGAATTGGAAAAAACTCAAAAAGAATTAGATGCAG 
CTCTTAATGAGTTAGGCCCTGATGGAGATGAAGAAGAAACTCCAGCGCCGGCTCCTCAACCAGAGCAACCAGCTCCT 
G CAC C AAAAC C AGAGC AAC C AG CTCC AG CT C C AAAAC C AGAGC AAC C AGCT C CTGCAC C AAAAC CAGAGC AAC CAG C 
T C C AG C T C C AAAAC C AGAG C AAC C AG CT C C AG CT C CAAAAC C AGAG CAAC C AG CT AAG C CGG AGAAAC C AGCTGAAG 
AGCCTACTCAACCAGAAAAACCAGCCACTCCAAAAACAGGCTGGAAACAAGAAAACGGTATGTGGTATTTCTACAAT 
ACTGATGGTTCAATGGCAATAGGTTGGCTCCAAAACAACGGTTCATGGTACTACCTAAACGCTAACGGCGCTATGGC 
AACAGGTTGGGTGAAAGATGGAGATACCTGGTACTATCTTGAAGCATCAGGTGCTATGAAAGCAAGCCAATGGTTCA 
AAGTATCAGATAAATGGTACTATGTCAACAGCAATGGCGCTATGGCGACAGGCTGGCTCCAATACAATGGCTCATGG 
TACTACCTCAACGCTAATGGTGATATGGCGACAGGATGGCTCCAATACAACGGTTCATGGTATTACCTCAACGCTAA 
TGGTGATATGGCGACAGGATGGGCTAAAGTCAACGGTTCATGGTACTACCTAAACGCTAACGGTGCTATGGCTACAG 
GTTGGGCTAAAGTCAACGGTTCATGGTACTACCTAAACGCTAACGGTTCAATGGCAACAGGTTGGGTGAAAGATGGA 
GATACCTGGTACTATCTTGAAGCATCAGGTGCTATGAAAGCAAGCCAATGGTTCAAAGTATCAGATAAATGGTACTA 
TGTCAATGGCTTAGGTGCCCTTGCAGTCAACACAACTGTAGATGGCTATAAAGTCAATGCCAATGGTGAATGGGTTT 
AA 

MNKKKMILTSLASVAILGAGFVTSQPTFVRAEESPQ 

DQKRTEEKARKEAEASQKL1TOVALVVQNAYKEYR 

RAVWPEPNALAETKKKAEEAKAEEKVAKRKYDYATL 

KKLLAGADPDDGTEVIEAKLKKGEAELNAKQAE^KKQTELEKLLDSLDPEGKTQDELDKEAEEAELDKKADELQNK 

VADLEKE I SNLE I LLGGADPEDDTAALQNKLAAKKAELAKKQTELEKLLDS LD PEGKTQDELDKEAEEAELDKKADE 

LQNKVADLEKEISNLEILLGGADSEDDTAALQNKLATKKAELEKTQKELDAALNELGPDGDEEETPAPAPQPEQPAP 

APKPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAPAPKPEQPAKPEKPAEEPTQPEKPATPKTGWKQENGMWYFYN 

TDGSMAIGWLQNNGSWYYLNANGAMATGWVKDGDT 

YYLNANGDMATGWLQYNGSWYYLNANGDMATGWAKVNGSW 

DTWYYLEASGAMKASQWFKVSDKWYYVNGLGALAVNTTVDGYKVNANGEWVZ 

ID302 

ATGTTTGCATCAAAAAGCGAAAGAAAAGTACATTATTCAATTCGTAAATTTAGTGTTGGAGTAGCTAGTGTAGTTGT 
TGCCAGTCTTGTTATGGGAAGTGTGGTTCATGCGACAGAGAACGAGGGAGCTACCCAAGTACCCACTTCTTCTAATA 
GGGCAAATGAAAGTCAGGCAGAACAAGGAGAACAACCTAAAAAACTCGATTCAGAACGAGATAAGGCAAGGAAAGAG 
GTCGAGGAATATGTAAAAAAAATAGTGGGTGAGAGCTATGCAAAATCAACTAAAAAGCGACATACAATTACTGTAGC 
TCTAGTTAACGAGTTGAACAACATTAAGAACGAGTATTTGAATAAAATAGTTGAATCAACCTCAGAAAGCCAACTAC 
AGATACTGATGATGGAGAGTCGATCAAAAGTAGATGAAGCTGTGTCTAAGTTTGAAAAGGACTCATCTTCTTCGTCA 
AGTTCAGACTCTTCCACTAAACCGGAAGCTTCAGATACAGCGAAGCCAAACAAGCCGACAGAACCAGGAGAAAAGGT 
AGCAG AAG CT AAG AAGAAGGTTG AAG AAG C TG AGAAAAAAG C C AAGG AT C AAAAAGAAGAAGAT CGT CGT AACT AC C 
CAACCATTACTTACAAAACGCTTGAACTTGAAATTGCTGAGTCCGATGTGGAAGTTAAAAAAGCGGAGCTTGAACTA 
GTAAAAGTGAAAGCTAACGAACCTCGAGACGAGCAAAAAATTAAGCAAGCAGAAGCGGAAGTTGAGAGTAAACAAGC 
TGAGGCTACAAGGTTAAAAAAAATCAAGACAGATCGTGAAGAAGCAGAAGAAGAAGCTAAACGAAGAGCAGATGCTA 
AAGAGCAAGGTAAACCAAAGGGGCGGGCAAAACGAGGAGTTCCTGGAGAGCTAGCAACACCTGATAAAAAAGAAAAT 
GATGCGAAGTCTTCAGATTCTAGCGTAGGTGAAGAAACTCTTCCAAGCCCATCCCTGAAACCAGAAAAAAAGGTAGC 
AGAAG C T G AG AAGAAGG T T GAAG AAG C T AAGAAAAAAGC C G AGG AT C AAAAAGAAGAAGAT CGC CGT AACT AC C C AA 
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CCAATACTTACAAAACGCTTGAACTTGAAATTGCTGAGTCCGATGTGGAAGTTAAAAAAGCGGAGCTTGAACTAGTA 
AAAG AGG AAG CTAAGGAAC C T CGAAACG AGG AAAAAGT T AAGC AAG C AAAAG CGGAAGTT GAGAGT AAAAAAG CTGA 
GG C T AC AAGGTTAG AAAAAAT C AAGACAG AT C GT AAAAAAGC AG AAG AAG AAG CT AAACGAAAAGC AGC AG AAGAAG 
ATAAAGTTAAAGAAAAACCAGCTGAACAACCACAACCAGCGCCGGCTCCAAAAGCAGAAAAACCAGCTCCAGCTCCA 
AAAC CAG AG AAT C CAG CTGAAC AAC C AAAAGCAG AAAAAC C AG C TG AT CAAC AAG CTGAAGAAGACT ATG CT CGT AG 
AT C AG AAGAAG AATATAATCG CTTG AC T CAAC AG C AAC CG C CAAAAACTGAAAAAC C AG C AC AAC CAT CTACTCCAA 
AAACAGGCTGGAAACAAGAAAACGGTATGTGGTACTTCTACAATACTGATGGTTCAATGGCGACAGGATGGCTCCAA 
AACAATGGCTCATGGTACTACCTCAACAGCAATGGCGCTATGGCGACAGGATGGCTCCAAAACAATGGTTCATGGTA 
CTATCTAAACGCTAATGGTTCAATGGCAACAGGATGGCTCCAAAACAATGGTTCATGGTACTACCTAAACGCTAATG 
GTTCAATGGCGACAGGATGGCTCCAATACAATGGCTCATGGTACTACCTAAACGCTAATGGTTCAATGGCGACAGGA 
TGGCTCCAATACAATGGCTCATGGTACTACCTAAACGCTAATGGTGATATGGCGACAGGTTGGGTGAAAGATGGAGA 
TACCTGGTACTATCTTGAAGCATCAGGTGCTATGAAAGCAAGCCAATGGTTCAAAGTATCAGATAAATGGTACTATG 
TCAATGGCTCAGGTGCCCTTGCAGTCAACACAACTGTAGATGGCTATGGAGTCAATGCCAATGGTGAATGGGTAAAC 
TAA 

MFAS KS ERKVHYS I RKFS VGVAS VWAS LVMGS WHATENEGATQVPTS SNRANESQAEQGEQP KKLDS ERDKARKE 
VEEYVKKIVGESYAKSTKKRHTITVALVNELNNI 

SSDSSTKPEASDTAKPNKPTEPGEKVAEAKKKVEEAEKKAKDQKEEDRRNYPTITYKTLELEIAESDVEVKKAELEL 

VKVKANEPRDEQKIKQAEAEVESKQAEATRLKKIKTDREEAEEEAKRRADAKEQGKPKGRAKRGVPGELATPDKKEN 

DAK5SDSSVGEETLPSPSLKPEKKVAEAEKKVEEAKKKAEDQKEEDRRNYPTNTYKTLELEIAESDVEVKKAELELV 

KEEAKEPRNEEKVKQAKAEVESKKAEATRLEKIKTDRKKAEEEAKRKAAEEDK^/KEKPAEQPQPAPAPKAEKPAPAP 

KPENPAEQPKAEKPADQQAEEDYARRSEEEYNRLTQQQPPKTEKPAQPSTPKTGWKQENGMWYFYNTDGSMATGWLQ 

NNGSWYYLNSNGAMATGWLQNNGSWYYLNANGSMATGWLQNNGSWYYLNANGSMATGWLQTO 

WLQYNGSWYYLNANGDI^TGWKDGDTWYYLEASGAMKASQ 

Z 

ID303 

ATGGTAAAAAGACGTATAAGGAGAGGGACGAGAGAACCTGAAAAAGTTGTTGTTCCTGAGCAATCATCTATTCCTTC 
GTATCCTGTATCTGTTACATCTAACCAAGGAACAGATGTAGCAGTAGAACCAGCTAAAGCAGTTGCTCCAACAACAG 
ACTGGAAACAAGAAAATGGTATGTGGTATTTTTATAATACTGATGGTTCCATGGCAACAGGTTGGGTACAAGTTAAT 
AGTTCATGGTACTACCTCAACAGCAACGGTTCTATGAAAGTCAATCAATGGTTCCAAGTTGGTGGTAAATGGTATTA 
TGTAAATACATCGGGTGAGTTAGCGGTCAATACAAGTATAGATGGCTATAGAGTCAATGATAATGGTGAATGGGTGC 
GTTAA 

^KRRIRRGTREPEKVWPEQSSIPSYPVSVTSNQGTDVAVEPAKAVAPTTDWKQENGMWYFYNTDGSMATGWVQVN 
SSWYYLNSNGSMKVNQWFQVGGKWYYVNTSGELAVNTSIDGYRVNDNGEWVRZ 

ID3 0 4 

CTGAATACAAGTTTTGTTCATGCTGCTGATGGGATTCAATATGTCAGAGATGATACTAGAGATAAAGAAGAGGGAAT 
AGAGTATGATGACGCTGACAATGGGGATATTATTGTAAAAGTAGCGACTAAACCTAAGGTAGTAACCAAGAAAATTT 
CAAGTACGCGAATTCGTTATGAAAAAGATGAAACAAAAGACCGTAGTGAAAATCCTGTTACAATTGATGGAGAGGAT 
GGCTATGTAACTACGACAAGGACCTACGATGTTAATCCAGAGACTGGTTATGTTACCGAACAGGTTACTGTTGATAG 
AAAAGAAGCCACGGATACAGTTATCAAAGTTCCAGCTAAAAGCAAGGTTGAAGAAGTTCTTGTTCCATTTGCTACTA 
AATATGAAGCAGACAATGACCTTTCTGCAGGACAGGAGCAAGAGATTACTCTAGGAAAGAATGGGAAAACAGTTACA 
ACGATAACTTATAATGTAGATGGAAAGAGTGGACAAGTAACTGAGAGTACTTTAAGTCAAAAAAAAGACTCTCAAAC 
AAGAGTTGTTAAAAAAAGAACCAAGCCCCAAGTTCTTGTCCAAGAAATTCCAATCGAAACAGAATATCTCGATGGCC 
CAACTCTTGATAAAAGTCAAGAAGTAGAAGAAGTAGGAGAAATTGGTAAATTACTCTTACTACAATCTATACTGTAG 

LNTS FVHAADGIQYVRDDTRDKEEGIEYDDADNGDI IVKVATKPKWTKKISSTRIRYEKDETKDRSENPVTIDGED 
GYVTTTRTYDVNPETGYVTEQVTVDRKEATDWIKV 

TITYNVDGKSGQVTESTLSQKKDSQTRWKKRTKPQVLVQEIPIETEYLDGPTLDKSQEVEEVGEIGKLLLLQSILZ 



ID3 0 5 

ATGAAGCTTTTGAAAAAAATGATGCAAATCGCACTAGCCACATTTTTCTTCGGTTTGTTAGCGACAAATACAGTATT 
TGCAGATGATTCTGAAGGATGGCAGTTTGTCCAAGAAAATGGTAGAACCTACTACAAAAAGGGGGATCTAAAAGAAA 
CCTACTGGAGAGTGATAGATGGGAAGTACTATTATTTTGATCCTTTATCCGGAGAGATGGTTGTCGGCTGGCAATAT 
ATACCTGCTCCACACAAGGGGGTTACGATTGGTCCTTCTCCAAGAATAGAGATTGCTCTTAGACCAGATTGGTTTTA 
TTTTGGTCAAGATGGTGTATTACAAGAATTTGTTGGCAAGCAAGTTTTAGAAGCAAAAACTGCTACGAATACCAACA 
AACATCATGGGGAAGAATATGATAGCCAAGCAGAGAAACGAGTCTATTATTTTGAAGATCAGCGTAGTTATCATACT 
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TTAAAAACTGGTTGGATTTATGAAGAGGGTCATTGGTATTATTTACAGAAGGATGGTGGCTTTGATTCGCGCATCAA 
CAGATTGACGGTTGGAGAGCTAGCACGTGGTTGGGTTAAGGATTACCCTCTTACGTATGATGAAGAGAAGCTAAAAG 
CAGCTCCATGGTACTATCTAAATCCAGCAACTGGCATTATGCAAACAGGTTGGCAATATCTAGGTAATAGATGGTAC 
TACCTCCATTCGTCAGGAGCTATGGCAACTGGCTGGTATAAGGAAGGCTCAACTTGGTACTATCTAGATGCTGAAAA 
5 TGGTGATATGAGAACTGGCTGGCAAAACCTTGGGAACAAATGGTACTATCTCCGTTCATCAGGAGCTATGGCAACTG 
GTTGGTATCAGGAAAGTTCGACTTGGTACTATCTAAATGCAAGTAATGGAGATATGAAAACAGGCTGGTTCCAAGTC 
AATGGTAACTGGTACTATGCCTATGATTCAGGTGCTTTAGCTGTTAATACCACAGTAGGTGGTTACTACTTAAACTA 
TAATGGTGAATGGGTTAAGTAA 

10 MKLLKKMMQIALATFFFGLLATNTVFADDSEGW 

IPAPHKGVTIGPSPRIEIALRPDWFYFGQDGVLQEFVGKQVLEAKTATNTNKHHGEEYDSQAEKRVYYFEDQRSYHT 
LKTGWIYEEGHWYYLQKDGGFDSRINRLTVGELARGWVKDYPLTYDEEKLKAAPWYYLNPATGIMQTGWQYLGNRWY 
YLHSSGAMATGWYKEGSTWYYLDAENGDMRTGWQNLGNKWYYLRSSGAMATGWYQESSTWYYLNASNGDMKTGWFQV 
NGNWYYAYDSGALAVNTTVGGYYLNYNGEWVKZ 

15 

ID306 

T TGG C TGGT AG AT ATGGTT C TG CTGTT C AGTGT AC AGAAGT G ACTG C CT C AAAC CTTT C AAC AGT T AAAACT AAAGC 
TACGGTTGTAGAAAAACCACTGAAAGATTTTAGAGCGTCTACGTCTGATCAGTCTGGTTGGGTGGAATCTAATGGTA 
AATGGTATTTCTATGAGTCTGGTGATGTGAAGACAGGTTGGGTGAAAACAGATGGTAAATGGTACTATTTGAATGAC 
20 TTAGGTGTCATGCAGACTGGATTTGTAAAATTTTCTGGTAGCTGGTATTACTTGAGCAATTCAGGTGCTATGTTTAC 
AGGCTGGGGAACAGATGGTAGCAGATGGTTCTACTTTGACGGCTCAGGAGCTATGAAGACAGGCTGGTACAAGGAAA 
ATGGCACTTGGTATTACCTTGACGAAGCAGGTATCATGAAGACAGGTTGGTTTAAAGTCGGACCACACTGGTACTAT 
GCCTACGGTTCAGGAGCTTTGGCTGTGAGCACAACAACACCAGATGGTTACCGTGTAAATGGTAATGGTGAATGGGT 
AAACTAG 



25 



I^GRYGSAVQCTEVTASNLSTVKTKATWEKPLKDFRASTSDQSGWESNGK^YFYESGDVKTGWVKTDGKWYYLND 
LGVMQTGFVKFSGSWYYLSNSGAMFTGWGTDGSRWFYFDGSGAMKTGWYKENGTWYYLDEAGIMKTGWFKVGPHWYY 
A YGS G ALAVS TTT PDG YRVNGNGE WVN Z 



30 ID307 

ATGAAAATTTTGAAAAAAACTATGCAAGTTGGACTGACAGTATTTTTCTTTGGTTTGCTAGGGACCAGTACAGTATT 
TGCAGATGATTCTGAAGGATGGCAGTTTGTCCAAGAAAACGGAAGAACCTACTACAAAAAGGGGGACCTCAAAGAAA 
CCTACTGGCGAGTGATTGATGGTAAGTACTATTATTTTGATTCTCTATCTGGAGAGATGGTTGTCGGCTGGCAATAT 
ATCCCGTTTCCATCTAAAGGTAGTACAATTGGTCCTTACCCAAATGGTATCAGATTAGAAGGTTTTCCAAAGTCAGA 

35 GTGGTACTACTTCGATAAAAATGGAGTGCTACAAGAGTTTGTTGGTTGGAAAACATTAGAGATTAAAACTAAAGACA 
GTGTTGGAAGAAAGTACGGGGAAAAACGTGAAGATTCAGAAGATAAAGAAGAGAAGCGTTATTATACGAACTATTAC 
TTTAATCAAAATCATTCTTTAGAGACAGGTTGGCTTTATGATCAGTCTAACTGGTATTATCTAGCTAAGACGGAAAT 
TAATGGAGAAAACTACCTTGGTGGTGAAAGACGTGCGGGGTGGATAAACGATGATTCGACTTGGTACTACCTAGATC 
CAACAACTGGTATTATGCAAACAGGTTGGCAATATCTAGGTAATAAGTGGTACTACCTCCGTTCCTCAGGAGCAATG 

40 GCCACTGGCTGGTATCAGGAAGGTACCACTTGGTATTATTTAGACCACCCAAATGGCGATATGAAAACAGGTTGGCA 
AAACCTTGGGAACAAATGGTACTATCTCCGTTCATCAGGAGCTATGGCAACTGGTTGGTATCAAGATGGTTCAACTT 
GGTACTACCTAAATGCAGGTAATGGAGACATGAAGACAGGTTGGTTCCAGGTCAATGGCAACTGGTACTATGCTTAT 
AGCTCAGGTGCTTTGGCAGTGAATACGACCGTAGATGGCTATTCTGTCAACTATAATGGCGAATGGGTTCGGTAA 

45 MKILKKTMQVGLWFFFGLLGTSTVFADDSEGWQFVQENGRTYYKKGDLKETYWRVIDGKYYYFDSLSGEMVVGWQY 
IPFPSKGSTIGPYPNGIRLEGFPKSEWYYFDKNGVLQEFVGWKTLEIKTKDSVGRKYGEKREDSEDKEEKRYYTNYY 
FNQNHSLETGWLYDQSbmYYI^KTEINGENYLGGERRAGWINDDSTWYYLDPTTGIMQTGWQYLGNKWYYLRSSGAM 
ATGWYQEGTTWYYLDHPNGDMKTGWQNLGNKWYYLRSSGAMATGWYQDGSTWYYLNAGNGDMKTGWFQVNGNWYYAY 
SSGALAVNTTVDGYSVNYNGEWVRZ 

50 

ID3 0 8 

ATGAAAAAGAAATTAACTAGTTTAGCACTTGTAGGCGCTTTTTTAGGTTTGTCATGGTATGGGAATGTTCAGGCTCA 
AGAAAGTTCAGGAAATAAAATCCACTTTATCAATGTTCAAGAAGGTGGCAGTGATGCGATTATTCTTGAAAGCAATG 

55 GACATTTTGCCATGGTGGATACAGGAGAAGATTATGATTTCCCAGATGGAAGTGATTCTCGCTATCCATGGAGAGAA 
GGAATTGAAACGTCTTATAAGCATGTTCTAACAGACCGTGTCTTTCGTCGTTTGAAGGAATTGGGTGTCCAAAAACT 
TGATTTTATTTTGGTGACCCATACCCACAGTGATCATATTGGAAATGTTGATGAATTACTGTCTACCTATCCAGTTG 
ACCGAGTCTATCTTAAGAAATATAGTGATAGTCGTATTACTAATTCTGAACGTCTATGGGATAATCTGTATGGCTAT 
GATAAGGTTTTACAGACTGCTGCAGAAAAAGGTGTTTCAGTTATTCAAAATATCACACAAGGGGATGCTCATTTTCA 

60 GT TTGGG G AC ATGG AT AT T C AG CT C T AT AAT T ATGAAAAT G AAAC TGAT T CAT C GGGTG AAT T AAAG AAAATTTGG G 
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ATGACAATTCCAATTCCTTGATTAGCGTGGTGAAAGTCAATGGCAAGAAAATTTACCTTGGGGGCGATTTAGATAAT 
GTTCATGGAGCAGAAGACAAGTATGGTCCTCTCATTGGAAAAGTTGATTTGATGAAGTTTAATCATCACCATGATAC 
CAACAAATCAAATACCAAGGATTTCATTAAAAATTTGAGTCCGAGTTTGATTGTTCAAACTTCGGATAGTCTACCTT 
GGAAAAATGGTGTT GAT AGTG AGT ATGT TAAT T GG CT C AAAGAACGAGGAAT TGAG AG AAT CAACG CAG C C AGCAAA 
GACTATGATGCAACAGTTTTTGATATTCGAAAAGACGGTTTTGTCAATATTTCAACATCCTACAAGCCGATTCCAAG 
TTTTCAAGCTGGTTGGCATAAGAGTGCATATGGGAACTGGTGGTATCAAGCGCCTGATTCTACAGGAGAGTATGCTG 
TCGGTTGGAATGAAATCGAAGGTGAATGGTATTACTTTAACCAAACGGGTATCTTGTTACAGAATCAATGGAAAAAA 
TGGAACAATCATTGGTTCTATTTGACAGACTCTGGTGCTTCTGCTAAAAATTGGAAGAAAATCGCTGGAATCTGGTA 
TTATTTTAACAAAGAAAACCAGATGGAAATTGGTTGGATTCAAGATAAAGAGCAGTGGTATTATTTGGATGTTGATG 
GTTCTATGAAGACAGGATGGCTTCAATATATGGGGCAATGGTATTACTTTGCTCCATCAGGGGAAATGAAAATGGGC 
TGGGTAAAAGATAAAGAAACCTGGTACTATATGGATTCTACTGGTGTCATGAAGACAGGTGAGATAGAAGTTGCTGG 
TCAACATTATTATCTGGAAGATTCAGGAGCTATGAAGCAAGGCTGGCATAAAAAGGCAAATGATTGGTATTTCTACA 
AGACAGACGGTTCACGAGCTGTGGGTTGGATCAAGGACAAGGATAAATGGTACTTCTTGAAAGAAAATGGTCAATTA 
CTTGTGAACGGTAAGACACCAGAAGGTTATACTGTGGATTCAAGTGGTGCCTGGTTAGTGGATGTTTCGATCGAGAA 
ATCTG CTACAATTAAAACTACAAGTCATTCAGAAATAAAAGAAT CCAAAGAAGTAGTGAAAAAGGATCTTGAAAATA 
AAGAAACGAGTCAACATGAAAGTGTTACAAATTTTTCAACTAGTCAAGATTTGACATCCTCAACTTCACAAAGCTCT 
GAAACGAGTGTAAACAAATCGGAATCAGAACAGTAG 

MKKKLTSLALVGAFLGLSWYGNVQAQESSGNKIHFINVQEGGSDAIILESNGHFAMVDTGEDYDFPDGSDSRYPWRE 
GIETSYKHVLTDRVFRRLKELGVQKLDFILVTHTHSDHIGNVDELLSTYPVDRVYLKKYSDSRITNSERLWDNLYGY 
DKVLQTAAEKGVSVIQNITQGDAHFQFGDI^IQLYNYENETDSSGELKKIWDDNSNSLISWKVNGKKIYLGGDLDN 
VHGAEDKYGPLIGKVDLMKFNHHHDTNKSNTKDFIKNLSP^ 

D YDATVFD I RKDGFVNI S TSYKP I PS FQAGWHKS AYGNWWYQAPDSTGE YAVGWNE I EGEWYYFNQTGI LLQNQWKK 

WNNHWFYLTDSGASAKNWKKIAGIWYYFNKENQM 

WVKDKETWYYMDSTGVMKTGEIEVAGQHYYLEDSGAMKQGWHK^ 

LVNGKTPEGYTVDSSGAWLVDVSIEKSATIKTTSHSEIKESKEVVKKDLENKETSQHESVTNFSTSQDLTSSTSQSS 
ETSVNKSESEQZ 

ID309 

ATGGAAATTAATGTGAGTAAATTAAGAACAGATTTGCCTCAAGTCGGCGTGCAACCATATAGGCAAGTACACGCACA 
CTCAACTGGGAATCCGCATTCAACCGTACAGAATGAAGCGGATTATCACTGGCGGAAAGACCCAGAATTAGGTTTTT 
TCTCGCACATTGTTGGGAACGGTTGCATCATGCAGGTAGGACCTGTTGATAATGGTGCCTGGGACGTTGGGGGCGGT 
TGGAATGCTGAGACCTATGCAGCGGTTGAACTGATTGAAAGCCATTCAACCAAAGAAGAGTTCATGACGGACTACCG 
CCTTTATATCGAACTCTTACGCAATCTAGCAGATGAAGCAGGTTTGCCGAAAACGCTTGATACAGGGAGTTTAGCTG 
GAATTAAAACGCACGAGTATTGCACGAATAACCAACCAAACAACCACTCAGACCACGTTGACCCTTATCCATATCTT 
GCTAAATGGGGCATTAGCCGTGAGCAGTTTAAGCATGATATTGAGAACGGCTTGACGATTGAAACAGGCTGGCAGAA 
GAATGACACTGGCTACTGGTACGTACATTCAGACGGCTCTTATCCAAAAGACAAGTTTGAGAAAATCAATGGCACTT 
GGTACTACTTTGACAGTTCAGGCTATATGCTTGCAGACCGCTGGAGGAAGCACACAGACGGCAACTGGTACTGGTTC 
GACAACTCAGGCGAAATGGCTACAGGCTGGAAGAAAATCGCTGATAAGTGGTACTATTTCAACGAAGAAGGTGCCAT 
G AAGAC AGGC TGGGT C AAGT AC AAGGAC AC TTGGT AC T ACT T AG ACG CT AAAGAAGGCG C C ATGGT AT C AAATG C C T 
TTATCCAGTCAGCGGACGGAACAGGCTGGTACTACCTCAAACCAGACGGAACACTGGCAGACAAGCCAGAATTCACA 
GT AG AG CC AG ATGGCTTG AT TACAGTAAAATAA 

MEINVSKLRTDLPQVGVQPYRQVHAHSTGNPHSTVQNEADYHWRKDPELGFFSHIVGNGCIMQVGPVDNGAWDVGGG 

WNAETYAAVELIESHSTKEEFMTDYRLYIELLRNLADEAGLPKTLDTGSLAGIKTHEYCTNNQPNNHSDHVDPYPYL 

AKWGISREQFKHDIENGLTIETGWQKNDTGYWYVHSDGSYPPODKFEKINGTWYYFDSSGYMLADRWRKHTDG 

DNSGEMATGWKKIADKWYYFNEEGAMKTGOTKYKDTWYYLDAKEGAMVSNAFIQSADGTGWYYLKPDGTIJUDKPEFT 

VEPDGLITVKZ 

ID310 

ATGGGCACAACAGGATTTACAATAATTGACTTAATTATCTTGATTGTTTATTTACTTGCGGTGTTGGTTGCAGGTAT 
CTATTTCTCTAAAAAAGAGATGAAAGGAAAAGAGTTCTTTAAAGGAGATGGTTCGGTTCCTTGGTATGTTACTTCGG 
TATCCATTTTTGCCACAATGCTCAGTCCGATTTCCTTCTTGGGACTCGCTGGTAGCTCTTATGCAGGTAGCTGGATT 
TTATGGTTTGCTCAATTAGGGATGGTAGTAGCTATTCCACTGACAATTCGTTTTATCTTACCTATCTTTGCACGGAT 
AGACATCGATACGGCATATGATTACTTGGATAAACGTTTTAATTCTAAAGCACTTCGTATTATTTCAGCACTCTTGT 
TTATTATTTATCAATTGGGACGTATGTCTATCATTATGTACCTCCCATCAGCTGGTTTATCAGTATTGACAGGAATT 
GACATCAATATTTTGATTATTTTGATGGGTGTAGTTGCAATTGTTTATTCTTATACTGGTGGTCTAAAATCCGTATT 
ATGGACAGACTTTATTCAAGGTGTGATTCTGATTAGTGGTGTCGTTTTAGCTTTATTTGTACTGATTGCTAATATTA 
AAGGTGGCTTTGGTGCAGTAGCAGAAACATTAGCAAACGGGAAATTCCTTGCTGCAAATGAAAAACTTTTCGATCCT 
AACTTGCTTTCAAACTCCATCTTTTTAATTGTGATGGGTTCAGGCTTTACAATCTTGTCTTCCTATGCTTCATCTCA 
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AGATTTGGTTCAACGTTTTACTACAACACAAAA.TATTAAGAAACTTAATAAGATGTTGTTCACAAACGGTGTTTTGT 
CACTTGCAACTGCAACAGTCTTTTACTTGATTGGTACAGGCTTGTACGTATTCTATCAAGTACAAAATGCAGATAGT 
G CAGC TAG C AATAT C C CT CAAG AC C AAAT CT T T ATGT ACTTTATTG CAT AC CAGT T AC C AGTAGGT AT CAC AGGT TT 
GATCTTGGCAGCGATTTATGCAGCATCTCAATCAACTATTTCAACAGGTTTGAACTCTGTTGCAACTTCATGGACAT 
5 TGGATATTCAAGATGTCATTTCTAAAAATATGTCAGACAATCGTCGTACGAAAATTGCACAATTCGTATCTCTAGCA 
GTAGGTTTATTCTCAATTGGTGTTTCCATTGTCATGGCTCACTCAGATATTAAATCTGCATACGAATGGTTCAATAG 
TTTCATGGGACTTGTACTTGGTCTACTTGGTGGTGTATTTATTCTTGGATTTGTTTCTAAAAAAGCAAATAAACAAG 
GTGCTTATGCAGCGCTGATTGTATCAACCATCGTCATGGTATTTATTAAATACTTCCTTCCTCCAACAGCTGTTAGC 
TACTGGGCATATTCATTGATTTCAATCTCTGTATCAGTAGTTTCAGGTTATATTGTATCTGTTCTTACTGGAAATAA 
10 AGTATCTGCACCTAAATATACAACGATTCATGATATTACAGAAATTAAAGCGGATTCAAGTTGGGAAGTTCGTCACT 
AA 

MGTTGFTIIDLIILIVYLLAVLVAGIYFSKKEMKGKEFFKGDGSVPWYVTSVSIFATMLSPISFLGI.AGSSYAGSWI 
LWFAQLGMWAI PLTIRFILPI FARIDIDTAYDYLDKRFNSKALRI ISALLFI I YQLGRMS I IMYLPSAGLSVLTGI 
15 DINILIILMGWAIVYSYTGGLKSVLWTDFIQGVILISGWLAL 

NLLSNS I FLI VMGSGFTILSSYASSQDLVQRFTTTQNI KKLNKMLFTNGVLSLATATVFYLIGTGLYVFYQVQNADS 
AASNIPQDQIFMYFIAYQLPVGITGLILAAIYAASQSTISTGLNSVATSWTLDIQDVISKNMSDNRRTKIAQFVSLA 
VGLFSIGVSIVMAHSDIKSAYEWFNSFMGLVLGLLGGVFILGFVSKKANKQGAYAALIVSTIVMVFIKYFLPPTAVS 
YWAYS L I S I SVSWS GY I VS VLTGNKVS AP KYTT IHDITEI KADS SWEVRH Z 

20 

ID311 

ATGAAAATTAATAAAAAATATCTAGCAGGTTCAGTGGCAGTCCTTGCCCTAAGTGTTTGTTCCTATGAGCTTGGTCG 
TCACCAAGCTGGTCAGGATAAGAAAGAGTCTAATCGAGTTGCTTATATAGATGGTGATCAGGCTGGTCAAAAGGCAG 
AAAAC TTGACAC C AGAT GAAGT C AGT AAGAGGGAGGGG AT C AACG C CG AAC AAAT CGT CAT CAAG ATT ACGGAT C AA 

25 GGTTATGTGACCTCTCATGGAGACCATTATCATTACTATAATGGCAAGGTCCCTTATGATGCCATCATCAGTGAAGA 
GCTCCTCATGAAAGATCCGAATTATCAGTTGAAGGATTCAGACATTGTCAATGAAATCAAGGGTGGTTATGTCATCA 
AGGTAGACGGAAAATACTATGTTTACCTTAAGGATGCAGCTCATGCGGATAATATTCGGACAAAAGAAGAGATTAAA 
CGTC AGAAG CAGGAACG CAGT CAT AAT CAC GGGT C AGG AGC TAACGAT C ATG C AGT AG CTG C AG C C AGAG C C C AAGG 
ACGCTATACAACGGATGATGGGTATATCTTCAATGCATCTGATATCATTGAGGACACGGGTGATGCTTATATCGTTC 

30 CTCACGGCGACCATTACCATTACATTCCTAAGAATGAGTTATCAGCTAGCGAGTTAGCTGCTGCAGAAGCCTATTGG 
AATGGGAAGCAGGGATCTCGTCCTTCTTCAAGTTCTAGTTATAATGCAAATCCAGCTCAACCAAGATTGTCAGAGAA 
CCACAATCTGACTGTCACTCCAACTTATCATCAAAATCAAGGGGAAAACATTTCAAGCCTTTTACGTGAATTGTATG 
CTAAACCCTTATCAGAACGCCATGTGGAATCTGATGGCCTTATTTTCGACCCAGCGCAAATCACAAGTCGAACCGCC 
AGAGGTGTAGCTGTCCCTCATGGTAACCATTACCACTTTATCCCTTATGAACAAATGTCTGAATTGGAAAAACGAAT 

35 TGCTCGTATTATTCCCCTTCGTTATCGTTCAAACCATTGGGTACCAGATTCAAGACCAGAACAACCAAGTCCACAAT 
CGACTCCGGAACCTAGTCCAAGTCCGCAACCTGCACCAAATCCTCAACCAGCTCCAAGCAATCCAATTGATGAGAAA 
TTGGTCAAAGAAGCTGTTCGAAAAGTAGGCGATGGTTATGTCTTTGAGGAGAATGGAGTTTCTCGTTATATCCCAGC 
C AAGG AT CT TT CAGC AGAAAC AG CAG C AGGC ATTG AT AG C AAACTGG C CAAG C AGGAAAGTTT AT CT C AT AAGC TAG 
GAGCTAAGAAAACTGACCTCCCATCTAGTGATCGAGAATTTTACAATAAGGCTTATGACTTACTAGCAAGAATTCAC 

40 CAAGATTTACTTGATAATAAAGGTCGACAAGTTGATTTTGAGGCTTTGGATAACCTGTTGGAACGACTCAAGGATGT 
CCCAAGTGATAAAGTCAAGTTAGTGGATGATATTCTTGCCTTCTTAGCTCCGATTCGTCATCCAGAACGTTTAGGAA 
AAC C AAATG CG C AAATTAC CT AC AC TGATGATGAG AT TCAAGT AGC CAAGTTGGC AGG CAAGT AC ACAAC AGAAG AC 
GGTTATATCTTTGATCCTCGTGATATAACCAGTGATGAGGGGGATGCCTATGTAACTCCACATATGACCCATAGCCA 
CTGGATTAAAAAAGATAGTTTGTCTGAAGCTGAGAGAGCGGCAGCCCAGGCTTATGCTAAAGAGAAAGGTTTGACCC 

45 CTC CT T CG AC AGAC CAT C AGGAT T C AGG AAAT ACTGAGGCAAAAGGAG C AGAAGCT AT CT AC AAC CGCGTG AAAG CA 

GCTAAGAAGGTGCCACTTGATCGTATGCCTTACAATCTTCAATATACTGTAGAAGTCAAAAACGGTAGTTTAATCAT 
ACCTCATTATGACCATTACCATAACATCAAATTTGAGTGGTTTGACGAAGGCCTTTATGAGGCACCTAAGGGGTATA 
CTCTTGAGGATCTTTTGGCGACTGTCAAGTACTATGTCGAACATCCAAACGAACGTCCGCATTCAGATAATGGTTTT 
GGTAACGCTAGCGACCATGTTCAAAGAAACAAAAATGGTCAAGCTGATACCAATCAAACGGAAAAACCAAGCGAGGA 

50 G AAAC C TCAGAC AGAAAAAC CTGAGG AAG AAAC C C CT CG AG AAGAGAAAC CG C AAAG CGAGAAAC C AG AGT CT C CAA 

AACCAACAGAGGAAC C AGAAG AAT C AC CAGAGGAATC AGAAG AAC CTC AGGT CGAGACTGAAAAGGTTGAAGAAAAA 
CTGAGAGAGGCTGAAGATTTACTTGGAAAAATCCAGGATCCAATTATCAAGTCCAATGCCAAAGAGACTCTCACAGG 
ATTAAAAAATAATTTACTATTTGGCACCCAGGACAACAATACTATTATGGCAGAAGCTGAAAAACTATTGGCTTTAT 
TAAAGGAGAGTAAGTAA 

55 

MKINKKYLAGSVAVLALSVCSYELGRHQAGQDKKESNRVAYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQ 
GYVTSHGDHYHY YNGKVP YDAI I S EELLMKDPNYQLKDSD I VNE I KGGYV I KVDGKYYVYLKDAAHADNI RTKE E I K 
RQKQERSHNHGSGANDHAVAAARAQGRYTTDDGYI FNASDI IEDTGDAYIVPHGDHYHYI PKNELSASELAAAEAYW 
NGKQGSRPSSSSSYNANPAQPRLSENHNLTVTPTYHQNQGENISSLLRELYAKPLSERHVESDGLI FDPAQITSRTA 
60 RGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPDSRPEQPSPQSTPEPSPSPQPAPNPQPAPSNPIDEK 
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LVKEAVRKVGDGYVFEENGVSRYIPAKDLSAETAAGIDSKLAKQESLSHKLGAKKTDLPSSDREFYNKAYDLLARIH 
QDLLDNKGRQVDFEALDNLLERLKDVPSDKVKLVD^ 

G Y I FD P RD I T S D E GD A YVT PHMTHS HW I KKD S L S E AE RAAAQ A Y AKE KGL T PPSTDHQDS GNT E AKGAE AI YNRVKA 
AKKVPLDRMPYNLQYTVEVKNGSLIIPHYDHYHNIKFEWFDEGLYEAPKGYTLEDLLATVKYYVEHPNERPHSDNGF 
5 GNASDHVQRNKNGQADTNQTEKPSEEKPQTEKPEEETPREEKPQSEKPESPKPTEEPEESPEESEEPQVETEKVEEK 
LREAEDLLGKIQDPIIKSNAKETLTGLKNNLLFGTQDNNTIMAEAEKLLALLKESKZ 

ID312 

ATGGAGGGATTGGTTAGAGTGCATTTATTGCCTGTATTTGGCGATTACAAGCTATCTAAACTTACTACGCCTATTCT 
10 TCAACAGCAAGTAAACAAATGGGCTGACAAGGCAAATAAAGGCGAAAAAGGGGCATTTGCTAACTACTCTTTGCTCC 
ATAACATGAATAAGCGTATTTTGAAATATGGCGTAGCTATCCAGGTAATACAATACAACCCAGCTAATGATGTCATC 
GTTCCACGCAAACAGCAAAAAGAAAAGGCTGCTGTCAAATACTTAGACAACAAAGAATTAAAACAGTTTCTTGATTA 
TTTAGATGCTCTGGATCAATCAAATTATGAGAACTTATTTGATGTTGTTCTGTATAAGACTTTATTGGCCACTGGTT 
GCCGTATTAGTGAGGCTCTGGCTCTTGAATGGTCTGATATTGACCTAGAAAGCGGTGTTATCAGCATCAATAAGACA 
15 CTAAACCGCTATCAGGAAATAAACTCACCTAAATCAAGCGCTGGTTATCGTGATATACCAATAGACAAAGCCACATT 
ACTTTTACTGAAACAATACAAAAACCGTCAACAAATTCAGTCTTGGAAATTAGGCCGATCTGAAACAGTTGTATTCT 
CTGTATTTACGGAGAAATATGCTTATGCTTGTAACTTACGCAAACGCCTAAATAAGCATTTTGATGCTGCTGGAGTA 
ACTAACGTATCATTTCATGGTTTCCGCCATACACATACTACTATGATGCTCTATGCTCAGGTTAGCCCGAAAGATGT 
T C AGT AT AG AT T AGG C C ACT C T AATTT AATGAT C AC TGAAAAT AC TT ACTGG CAT AC T AAC CAAGAG AATG C AAAAA 
20 AAGC CGTC T C AAATTATG AAAC AG CT AT C AAC AATT T ATAA 

MEGLVRVHLLPVFGDYKLS KLTTP I LQQQVNKWADKANKGE KGAFANYS LLHNMNKRI LKYGVAI QVIQ YNPANDVI 
VPRKQQKEKAAVKYLDNKELKQFLDYLDALDQSNYENLFDVVLYKTLLATGCRISEALALEWSDIDLESGVISINKT 
LNRYQEINSPKSSAGYRDIPIDKATLLLLKQYKI^RQQIQSWKLGRSETVVFSVFTEKYAYACNLRKRLNKHFDAAGV 
25 TNVSFHGFRHTHTTMMLYAQVSPKDVQYRLGHSNLMITENTYWHTNQENAKKAVSNYETAINNLZ 
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CLAIMS: 

1 . A Streptococcus pneumoniae protein or polypeptide having a sequence 
selected from those shown in table 2. 

5 

2. A Streptococcus pneumoniae protein or polypeptide having a sequence 
selected from those shown in table 4. 

3. A protein or polypeptide as claimed in claim 1 or claim 2 provided in 
10 substantially pure form. 

4. A protein or polypeptide which is substantially identical to one defined in any 
one of claims 1 to 3. 

15 5. A homologue or derivative of a protein or polypeptide as defined in any one of 
claims 1 to 4. 

6. An antigenic and/or immunogenic fragment of a protein or polypeptide as 
defined 

20 in Tables 2-4. 

7. A nucleic acid molecule comprising or consisting of a sequence which is: 

(i) any of the DNA sequences set out in Table 1 or their RNA equivalents; 

25 

(ii) a sequence which is complementary to any of the sequences of (i); 

(iii) a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 
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(iv) a sequence which is substantially identical with any of those of (i), (ii) 
and (iii); 

5 (v) a sequence which codes for a homologue, derivative or fragment of a 

protein as defined in Table 1 . 

8. A nucleic acid molecule comprising or consisting of a sequence which is: 

10 (i) any of the DNA sequences set out in Table 4 or their RNA equivalents; 

(ii) a sequence which is complementary to any of the sequences of (i); 

(iii) a sequence which codes for the same protein or polypeptide, as those 
15 sequences of (i) or (ii); 

(iv) a sequence which is substantially identical with any of those of (i), (ii) 
and (iii); 

20 (v) a sequence which codes for a homologue, derivative or fragment of a 

protein as defined in Table 4. 

9. The use of a protein or polypeptide having a sequence selected from those 
shown in Tables 2-4, or homologues, derivatives and/or fragments thereof, as an 

25 immunogen and/or antigen. 

10. An immunogenic and/or antigenic composition comprising one or more 
proteins or polypeptides selected from those whose sequences are shown in Tables 2- 
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4, or homologues or derivatives thereof, and/or fragments of any of these. 

11. An immunogenic and/or antigenic composition as claimed in claim 10 which 
is a vaccine or is for use in a diagnostic assay. 

5 

12. A vaccine as claimed in claim 11 which comprises one or more additional 
components selected from excipients, diluents, adjuvants or the like. 

13. A vaccine composition comprising one or more nucleic acid sequences as 
10 defined in Tables 1, 3 or 4. 

14. A method for the detection/diagnosis of S. pneumoniae which comprises the 
step of bringing into contact a sample to be tested with at least one protein or 
polypeptide as defined in Tables 2-4, or homologue, derivative or fragment thereof. 

15 

15. An antibody capable of binding to a protein or polypeptide as defined in 
Tables 2-4, or for a homologue, derivative or fragment thereof. 

16. An antibody as defined in claim 15 which is a monoclonal antibody. 

20 

17. A method for the detection/diagnosis of S. pneumoniae which comprises the step 
of bringing into contact a sample to be tested and at least one antibody as define din 
claim 15 or claim 16. 

25 18. A method for the detection/diagnosis of S. pneumoniae which comprises the 
step of bringing into contact a sample to be tested with at least one nucleic acid 
sequence as defined in claim 7 or claim 8. 
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19. A method of determining whether a protein or polypeptide as defined in 
Tables 2-4 represents a potential anti-microbial target which comprises inactivating 
said protein or polypeptide and determining whether S. pneumoniae is still viable in 
vitro or in vivo. 

20. The use of an agent capable of antagonising, inhibiting or otherwise 
interfering with the function or expression of a protein or polypeptide as defined in 
Tables 2-4 in the manufacture of a medicament for use in the treatment or 
prophylaxis of S. pneumoniae infection 
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Claim 20 relates to the use of an agent capable of antagonising, 
inhibiting or otherwise interfering with the function or expression of a 
protein or polypeptide as defined in Tables 2-4 in the manufacture of a 
medicament. Neither a true technical characterization is given for such 
an agent, nor is such an agent defined in the application. In 
consequence, the scope of said claim is ambiguous and vague, and its 
subject-matter is not suff ieciently disclosed and supported (Art. 5 and 6 
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No search can be carried out for such purely speculative claims whose 
wording is, in fact, a mere recitation of the result to be achieved. 
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molecule; method of determining whether said protein or 
polypeptide represents a potential anti -microbial target 
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Claim 20 relates to the use of an agent capable of antagonising, 
inhibiting or otherwise interfering with the function or expression of a 
protein or polypeptide as defined in Tables 2-4 in the manufacture of a 
medicament. Neither a true technical characterization is given for such 
an agent, nor is such an agent defined in the application. In 
consequence, the scope of said claim is ambiguous and vague, and its 
subject-matter is not suf f ieciently disclosed and supported (Art. 5 and 6 
PCT). 

No search can be carried out for such purely speculative claims whose 
wording is, in fact, a mere recitation of the result to be achieved. 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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1. Claims: in part: 1,3-7,9-19; all as far as applicable 
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to SEQ ID No 208, and vaccine comprising said nucleic acid; 
use of said protein or polypeptide as an immunogen and/or 
antigen; immunogenic and/or antigenic composition comprising 
said protein or polypeptide, and its use as a vaccine; 
antibody directed to said protein or polypeptide; method for 
the detection/diagnosis using either said 
protein/polypeptide, or said antibody, or said nucleic acid 
molecule; method of determining whether said protein or 
polypeptide represents a potential anti -microbial target 
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as invention 1 but limited to subject-matter relating SEQ ID 
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